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Abstract 

Weak gravitational lensing has several important effects on the cosmic microwave 



> 

Q"^ I background (CMB): it changes the CMB power spectra, induces non-Gaussianities, 

•/^ ' and generates a S-mode polarization signal that is an important source of confusion 

for the signal from primordial gravitational waves. The lensing signal can also be 
\^ I used to help constrain cosmological parameters and lensing mass distributions. We 

' review the origin and calculation of these effects. Topics include: lensing in General 

Relativity, the lensing potential, lensed temperature and polarization power spec- 
tra, implications for constraining inflation, non-Gaussian structure, reconstruction 
O ' of the lensing potential, delensing, sky curvature corrections, simulations, cosmo- 

+^ ' logical parameter estimation, cluster mass reconstruction, and moving lenses/dipole 

, lensing. 
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1 Introduction 



Observations of tlie cosmic microwave background (CMB) temperature and polarization are 
a powerful probe of the early universe and cosmological models. There are now convincing 
measurements from space, ground and balloons, and many future observations are planned at 
ever greater resolution and/or sensitivity^ . The basic theory of the CMB is well understood, 
with a robust prediction for acoustic oscillations in the primordial plasma giving rise to ^ 
characteristicpattern of peaks in the anisotropy power spectra (for reviews see e.g. Refs. 



cnaracteristic patti 



)). As ever better data becomes available, observations can resolve smaller 
effects including various non-linear signals that are important on small scales. One of the 
most important of these is weak lensing: the deflection of CMB photons coming from the 
last scattering surface by potential gradients along our line of sight (0; [l3 U; [13 The 
lensing has a quantitatively important effect on the temperature power spectrum, as well as 
introducing qualitatively new non-Gaussian and polarization signals. The lensing signal has 
the same frequency spectrum as the unlensed CMB, and hence cannot easily be distinguished. 
Understanding and modelling lensing will therefore be essential for the correct interpretation 
of near-future CMB data. 

This review is aimed at readers familiar with the basic theory of the unlensed CMB; however 
we shall start with a very brief overview of the relevant properties. We then describe why 
the effect of lensing is important, give order of magnitude estimates, and present a heuristic 
physical derivation of the deflection angle. We shall use the weak lensing approximation, so we 
discuss what this means and whether it is a valid approximation. We give a rigorous derivation 
of the deflection using General Relativity (GR) and discuss technical issues in Section |21 In 
Section 121 we describe how the main effects of lensing can be encapsulated into a single lensing 
potential, how this affects the CMB, and how to relate it to the matter power spectrum. 
In Section |3] we derive the important several-percent effect on the CMB temperature power 
spectrum due to lensing, and in Section El extend this to the polarization. We explain the 
importance of the 'i?-mode' polarization signal from lensing, in particular how it is a source 
of confusion with any primordial signal from gravitational waves. In addition to the B-mode 
signal, lensing also introduces a characteristic non-Gaussian signal in the CMB, as discussed 
in Section ini 

The extra information in the non-Gaussian structure of the lensed CMB sky actually provides 
enough information to learn something about the distribution of the matter causing the lens- 
ing. In Section [71 we discuss how the lensing potential can be reconstructed from observations 
of the lensed sky alone. If we know something about the lensing potential, this in principle 
allows us to 'delens' the sky to recover what the CMB would have looked like without lensing. 
We discuss methods for doing this in Section |H1 and in particular how the B-mode lensing 
signal can be subtracted to get at any residual interesting B-mode signal from primordial 
gravitational waves. 
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For most of this review we shall use the flat-sky approximation, as it greatly simplifies many of 
the derivations. In Section El we describe the small but potentially important corrections due 
to the fact that in reality observation directions have spherical geometry about our observa- 
tion point. In Sectional we describe the simulation of lensed CMB skies, essential for testing 
pipelines and assessing the importance of non-linear physics and other real-world complica- 
tions. Finally in Section ^2 we describe the current observational status and various possible 
applications of CMB lensing. These include using lensing to constrain dark energy, neutrino 
mass and other cosmological parameters, as well as learning about the properties of lensing 
clusters and galaxies. We also discuss the equivalent effects of moving lenses and dipole lensing. 



1.1 The unlensed CMB 



Observations show that the CMB temperature is approximately uniform at a temperature 
of 2.725K, with anisotropies at the (9(10~^) level The statistics of the observed CMB 
anisotropics can be explained as arising from acoustic oscillations in the primordial photon- 
baryon fluid, where the initial perturbations had a Gaussian, purely adiabatic, nearly scale- 
invariant spectrum 0). The small size of the perturbations means that linear perturbation 
theory is very accurate, and the acoustic oscillations can be modelled accurately to give 
robust predictions. What we observe is the epoch at which the universe became neutral at 
a redshift z ~ 10^, after which CMB photons can propagate more or less freely towards us. 
The perturbations on this high-redshift last-scattering surface are the main contribution to 
the CMB anisotropy observed, with additional large-scale anisotropies from the integrated 
Sachs- Wolfe f[l6h (ISW) effect from evolving potentials along the line of sight. The process of 
recombination is not instantaneous as it takes a while for most of the protons and electrons 
to combine into neutral hydrogen, though viewed today it is extremely thin compared to the 
distance to the last scattering surface. It is however slow enough for a photon quadrupole to 
develop, which then gives rise to polarization through Thomson scattering (17; 3; 3)- Thus 



the last scattering surface is expected to have an interesting polarization signal as well as an 
anisotropy in the CMB temperature. 



The late-time evolution of potentials gives an additional large-scale temperature anisotropy 
through the (late-) ISW effect, which means that the large scale CMB temperature is to some 
extent correlated with the large scale structure having the evolving potentials. Another im- 
portant late time effect is that of reionization: the epoch at which energetic photons from 
stars and quasars can reionize the neutral hydrogen gas. This is thought to have happened at 
redshifts somewhere between 6 < z < 20. The effect is important because the free electrons 
after reionization scatter some of the CMB photons, damping the small scale anisotropies. 
Furthermore the CMB quadrupole from last scattering will scatter from the free electrons 
giving rise to a polarization signal that can dominate the expected signal from the last scat- 
tering surface on large scales. Angular scales smaller than a couple of degrees are however only 
damped by the reionization: the small-scale CMB signal of most concern for CMB lensing is 
just proportional to the signal coming from the last scattering surface. The proportionality 
constant e"^"^ depends on the optical depth r to the last scattering surface. 
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There are irnportant second-order signals other than lensing, for example the Sunyaev-Zel'dovich 
(SZ) effect f)2Cl| ) (for reviews see e.g. Refs. (HJill)). These are generally generated at low red- 
shift, and are almost unaffected by lensing (|22ri. We shall therefore concentrate on the pro- 
cess of lensing of the linear CMB signal from the last scattering surface. Since the width of 
the last scattering surface is small (~ lOOMpc comoving) compared to its distance from us 
(~ 14000Mpc comoving), it is a good approximation to consider it as a single lens source 
plane. The effect of lensing on the reionization and ISW signals is generally very small as they 
are only important on large scales. 



1.2 CMB lensing order of magnitude 

Gravitational lensing of light by matter is expected even in Newtonian physics. If a particle 
passes a point mass at a safe distance, a simple Newtonian argument gives a deflection by an 
angle 2^>{R)/(? (assuming the deflection is small), where ^{R) is the potential at the point 
of closest approach on the undefiected path. As discussed below, a relativistic argument using 
General Relativity in fact gives twice this result. For weak lensing of the CMB we are interested 
in the deflection of a photon's direction as it travels from the last scattering surface to our 
observation point. On its way the photon will encounter various under- and over-densities, 
where in matter domination the potentials due to these perturbations are constant in the 
linear regime. The depth of the potentials is ~ 2 x 10~^, so we might expect each potential 
encountered to give a deflection 5(5 ~ 10~^. The characteristic size of potential wells given 
by the scale of the peak of the matter power spectrum is ~ SOOMpc (comoving), and the 
distance to last scattering is about 14000Mpc, so the number passed through is ~ 50. If the 
potentials are uncorrelated this would give an r.m.s. total deflection ~ 50^/^ x 10"^ ~ 7 x 10"^ 
corresponding to about ~ 2 arcminutes. We might therefore expect the lensing to become an 
order unity effect on the CMB at I > 3000. In fact the unlensed CMB has very little power on 
these scales due to damping, so lensing can dominate the observed power at / > 3000 in the 
absence of any other secondaries. Of course we have neglected several important effects here, 
including the correlation of the lensing clumps and the correct conversion factors between 
deflection angles and observed angles; however these order-of-magnitude results are about 
right. 

The deflection angles will be correlated over the sky by an angle given by the angular size of a 
characteristic potential, which is ~ 300/7000 ~ 2° for a potential mid- way to last scattering. 
So although the deflection angles are much smaller than the size of the degree-scale primary 
CMB acoustic peaks, they are correlated over a comparable scale. This means that the lensing 
can also have an important effect on the scale of the primary acoustic peaks. If a degree sized 
hot spot at last scattering is lensed, we expect the spot to appear larger or smaller by ~ 2', 
corresponding to a fractional change in size of ~ 2/60 ~ 3%. The lensing won't make a spot 
larger or smaller on average, but it will change the statistics of the size distribution. The 
distribution will be 3% wider, corresponding to a ~ 3% broadening on the acoustic peak: 
after lensing the size of the spot is less well defined on average. We therefore expect lensing to 
have percent level effects on the CMB spectrum on the scale of the primary acoustic peaks. 
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and to dominate the spectrum on arcminute scales. 



1.3 Weak-lensing deflection angle 



The idea that gravity can bend hght dates back over two hundred years ()23l ) . In this subsection 
we give some simple non-rigorous derivations to get the correct lowest order result for the 
deflection angle. In Section |2l we use General Relativity to give a more rigorous derivation and 
discuss higher-order corrections. 

Consider weak lensing of a photon with velocity v by a point of mass M within Newtonian 
theory. By weak lensing we mean that deflections due to the lensing are a small perturbations, 
so deflection angles etc. are small and may be treated accurately using first-order results. 
The acceleration due to the mass is given by the gradient of the potential causing a small 
transverse acceleration v± = — Vj.^E' = GMcosO/r'^, where 6 is the angle of the photon from 
the mass relative to its angle at distance of closest approach Rq. Integrating over the photon 
path for constant speed |v| = c gives a total deflection angle f±/|v| = 2GM / c^Rq, which is 
the standard Newtonian result. From now on we shall use natural units with c = 1. 

What changes in General Relativity? The idea that acceleration is due to a force holds in GR, 
with D^w = — V_l\1/ in the Newtonian gauge, where represents the covariant derivative 
along the photon world line. However to relate this local acceleration to a change in observed 
coordinate we also have to account for the curvature of space. The total effect is a local deflec- 
tion angle 6(3 = — 2(5xV_l\1/, where 6x is a small distance along the photon path. Essentially 
the GR result is the inertial Newtonian result plus an equal term from the effect of spacetime 
curvature. 

Since potentials are generally small (l^^l ^ 10^^; smaller on linear scales) the deflection angles 
are small, consistent with our small angle assumption. Now let's consider how this deflection 
affects the observed angle 6 of an object at comoving distance x*- Comoving distances are 
related to angles via the angular diameter distance /^(x)? where 

r sm{K^/^x) for K > 0, closed, 
fxix) = lx for = 0, flat, (1.1) 

[\K\-^/'^ smh{\K\^/^x) for < 0, open. 

The comoving distance that the source appears to have moved due to the lensing is, in the 
small angle approximation, fK{x*~x)^P = fxix*)^^ (see Fig.[T]). Solving for 69, the deflection 
due to the source at x is 

= fKix*-xW ^ jKix. - x) ^, 2) 

fxix*) Mx*) 

in the direction of V_l^. Adding up the deflections from all the potential gradients between 
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6/3 



Fig. 1. Weak lensing geometry for a source (the CMB) at comoving distance x* lensed by a potential 
^ at distance X) assuming a flat universe. The lensing deflection by an angle 6f3 changes the observed 
angle of the source by an angle 66. 

us and the source, we have a total deflection 



a. 



^* , fK{x*-x) 



fKix*] 



1.3) 



This is the main result that tells us the deflection angle in terms of the potential gradients 
along the line of sight. The quantity rjo — x is the conformal time at which the photon was at 
position xfi. It is only valid for weak lensing (small angles), and is only valid to lowest order 
in the potential. For a flat universe fxix* — x)/fK{x*) = ^~ x/x*- 

The lensed CMB temperature in a direction fi is given by the unlensed temperature in a 
deflected direction n', T(n) = T(n') = T(n + a). The derivative of the deflection angle deflnes 
a magniflcation matrix (see e.g. Ref. (0)) 



d_ 



■an 



^1 - K - 7i -72 + uj 
^ -72 - to 1 - K + 7i ^ 



;i-4) 



An inflnitesimal source with surface brightness I{h+6C,) at position about fi before lensing, 
becomes, after lensing, /(fi' + AS^). At lowest order the magniflcation of the intensity fi = 
\A\~^ = 1/[(1 — + — |7p] 1 + 2fi; is determined by the convergence, k = — |V ■ a. 
The shear 71 + ^72 determines the area-preserving distortion, and the antisymmetric piece u 
determines the rotation. Since Eq. ()1.3p is purely a derivative, the antisymmetric rotation u 
vanishes at lowest order. 
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Shear is the most important effect for galaxy lensing (j2J) as it makes the galaxy ellipticities 
correlated in a direction determined by the local shear. It can be used without knowing 
anything about the distribution of the source galaxies. However for the CMB we actually 
know a lot about the statistical distribution of the unlensed CMB, so shearing is not the only 
relevant effect: we can also learn information from perturbation-scale (i.e. non-infinitesimal) 
magnifications. Including shear and convergence effects on finite scales is most easily described 
as the remapping of points by the deflection angle. For this reason we shall usually discuss the 
deflection angle directly, though the shearing effect on the CMB is real and may be observable 
through changes to the hot and cold spot eUipticity distribution (,25; 2Q . 



1.4 Photon distribution 



The observed angle of a particular photon from the last scattering surface will be altered 
due to lensing by potential gradients along the line of sight. In general there will also be a 
perturbation to the frequency due to time-varying potentials. For linear perturbations this is 
just the integrated Sachs- Wolfe effect, in general it is the Rees-Sciama effect. By discussing 
only lensing we are explicitly not considering these frequency shifts, so they must be accounted 
for separately along with other secondary signals hke SZ. 

Lensing by transverse gradients does not change the frequency of the photons, and hence does 
not change the frequency distribution in a given direction except via the change in the angle 
of the source. Hence the lensed CMB has the same blackbody spectrum as the unlensed CMB, 
and multi-frequency observations cannot be used to separate the lensing signal. 

The number of photons arriving from a particular sized patch of the last scattering surface 
will be altered due to lensing: a magnifying lens along the line of sight will cause more photons 
to reach us from the patch than without lensing. However the angular size subtended by the 
patch is also increased proportionately, so the number of photons per unit solid angle remains 
the same. This is the rule that lensing conserves surface brightness. Since the frequency is also 
not altered by lensing, it means that lensing of an isotropic last scattering surface is completely 
unobservable: photons are simply moved around but arrive in the same distribution as before. 
It is only the presence of anisotropies on the last scattering surface that make the lensing 
effect interesting. 



1.5 Is weak lensing a good approximation? 



There are several different aspects of the weak lensing approximation, and 'weak' may be used 
to mean slightly different things in different contexts. The main criterion here is that deflection 
angles should be small, which is indeed the case with typical deflections being of the order of 
arcminutes. Deflections from non-linear structure are also small, with clusters typically giving 
angles of the order of an arcminute or less, and smaller structures such as galaxies giving only 
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arcsecond deflections. Only a minuscule fraction of lines of sight will come close to black holes 
or other dense bodies that violate the small-angle assumption. 

Small deflections are essential for the validity of the Born approximation: calculating the 
lensing effect using the potential gradients along the undeflected path. If deflection angles were 
large compared to the scale of lensing perturbations this would not be a good approximation 
and we would have to tackle the much harder problem of ray-tracing along the full deflected 
photon path. As it is, the Born app roximation is excellent for lensing on scales larger than a few 
arcminutes (see e.g. Refs. (|27[ Esf)). Lensing from a single cluster is also accurately modelled: 
the deflection occurs only in a very thin section of the photon path over which the deflected 
and undeflected paths are the same. Lensing from multiple clusters is more complicated (see 
Section f2.5|) . but rare and irrelevant for a CMB calculation on larger scales. On very small 
scales numerical simulations can account for the important additional effects fSectionllO|). 

Another use of the term weak lensing is to exclude cases of 'strong' lensing where lines of sight 
cross, giving caustics, multiple images and points of inflnite magniflcation and shear. This can 
happen even for tiny deflection angles if the source is small. For example massive clusters will 
have a CMB Einstein ring at a radius of ~ 1 arcminute from the centre, corresponding to 
the radius at which deflected rays meet at a point on the last scattering surface. Most of our 
analysis will in fact apply equally well to 'strong' lensing cases, all that we shall require is 
that deflection angles are small. 

Strong lensing and signiflcant magniflcations are in fact very common: a large fraction of lines 
of sight will have intersected other lines of sight by last scattering (jiit Isoi ). and many ray 
bundles will have experienced signiflcant (several percent) magniflcation due to encounters 
with small scale structures along the line of sight. Much of this effect comes from encoun- 
ters with very small sub-cluster scale structures, and implies that the light cone has a very 
complex structure on sub-arcminute scales. For the CMB this is fortunately not a problem. 
On arcminute scales the CMB is very smooth due to diffusion damping. Magniflcation of a 
smooth surface does not change what you see at all: if there is a galactic-scale area of large 
magniflcation of the CMB this will be essentially unobservable. To have an interesting effect 
we need arcminute-scale coherent magniflcation, corresponding to arcminute scale deflection 
angles. This is much larger than the amount of deflection produced by most of the struc- 
tures giving rise to large magniflcations. We can therefore proceed to calculate the effect on 
the CMB down to arcminute scales without worrying at all about small-scale lenses. What 
matters most for the CMB is the lensing effect of large-scale nearly-linear structures. 

Another possible worry is whether non-linear clumping in the late universe violates calcula- 
tions based on angular diameter distances and a low-order expansion about a homogeneous 
Friedmann-Robertson- Walker (FRW) universe. For example if two lines of sight happen to 
travel exclusively through voids, they will have diverged relative to the propagation expected 
in a flat FRW universe (|3^. However it turns out that on average the effect on angular 
diameter distances is cancelled by convergence caused by propagation through denser re- 



^ The contrary claim in Ref. (|3ir ) is erroneous. 
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gions ()33l : |3J; |29|; l35f l. Whether the variance is important has historically been the matter 
of much debate (see e.g. Refs. (fl^ IsgI: ll^). As discussed above the total magnification vari- 
ance may be large, but this variation comes largely from very small structures and is largely 
irrelevant for the CMB ^ : only a convergence field correlated on scales of many arcminutes 
can affect CMB acoustic scales. Also note that potentials are everywhere small, (< 10^'^ even 
including non-linear features), so we might expect that a systematic analysis based on linear 
theory that we present in later sections should be accurate to within fairly small non-linear 
corrections (Q (see also Refs. Jsl H H 13^)). 

We discuss higher-order corrections to the first-order lensing result in more detail in Sec- 
tion O 



2 Lensing in General Relativity 



To give a rigorous analysis of the lensing effect we must consider directly the propagation 
of light rays through the perturbed metric of spacetime. We do this in linearized form using 
a standard metric-based approach in Section 12.11 An alternative treatment based on the 
dynamics of infinitesimally-separated rays is given in Section \2.2\ which allows us to define 
the lensing defiection entirely in terms of directly observable quantities (in linear theory). We 

employ and assume familiarity with standard General Relativity with the (H ) signature 

and units such that c = 1. 



This section on General Relativity is mathematically more sophisticated than the rest of the 
review, and tackles some interesting conceptual and technical issues. However readers who are 
happy with Eq. ()1.3p can skip this section and proceed directly to Section El as the material 
in this section is not required for the remainder of the review. For more general reviews of 

gravitational lensing see Refs. (HHIIillii). 



2. 1 Perturbed photon paths 



We aim to compute the perturbed paths of light rays (null geodesies) in the real clumpy 
universe. In general, solving for the null geodesies requires numerical ray-tracing methods 
through large simulation volumes ([43). However, for CMB lensing on scales > 1 arcmin only 
weak lensing is expected to be important (see Section IT3|) in which case the perturbations of 
rays are small and a linearized treatment is appropriate. 

We only consider lensing by density perturbations [for lensing by gravitational waves, see 



Refs. (|4J;|45D], and for convenience we work in the conformal Newtonian gauge (for a discussion 



^ Ref. (ji^l claims a several-percent effect on the main CMB acoustic peaks; this is incorrect because 
the scale dependence and correlation of the magnification were not properly accounted for. 
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of gauge issues see Section |2T3|) . The line element is then 

ds^ = a\r])[{l + 2^N)dr/' - (1 + 2<l>^)j,,dx'dx^], (2.1) 

where rj is conformal time and the unperturbed spatial metric is such that 

7,,da;Mx^' = dx' + fMidO^ + sin^ ^d^^). (2.2) 

We shall work to first order in the scalar potentials $n and \E'n, so equalities below should be 
taken to apply at first order only. Since for lensing we are only calculating the path of null 
geodesies with ds^ = 0, we can equally well divide through by a^{ri){l + 2$n) and use the 
simpler conformally-related metric 

ds^ = (1 + A'^)dri^ - -fijdx'dx^ (2.3) 

Here we defined the 'Weyl potential' = (\E'n — '^'n)/2, so called because the general linear 
scalar-mode Weyl tensor can be expressed in terms of derivatives of \E' only. The conformal 
invariance of null geodesies means that they can only depend on the conformally-invariant 
Weyl part of the Riemann tensor. 

Affinely-parameterized null geodesies are solutions of the geodesic equation, 

d^xf" ^„ dx^dxP , , 

+ ri;,-^-^ = 0, (2.4) 



dA2 '^''dA dA 

with 5f^,^(dx^/dA)(dx'^/dA) = and ds^ = 0. Note that we have denoted the affine parameter 
in the conformal metric with an overhat since it necessarily differs from that in the original 
frame. In an unperturbed model, there are incoming radial solutions (with a focus at the origin 
at conformal time tjq) with X = Vo ~ V- 

The 0-component of the geodesic equation gives 

d^r? f dri\^ d'^ dri dx^ d'^ 

^ + 2M -^ + 2^^— = 0, 2.5 
dA2 \d\J d7] dA dA dx' ' 

where the derivative d'^/drj = (9^^^ + {dx'' / dri)di'^ is along the (perturbed) ray. Equation ()2.5|) 
is the only equation in this section that does not also apply in the original metric of Eq. ()2.ip 
since we cannot simply replace A by A. With Eq. (j2.5|) . we can eliminate the affine parameter 
A from (|2.4j) in favour of rj, giving 

dV ^dxWd^ dx^ :.d^ ,o^ -,• dx^' dx'= , , 

+ + 27^^ 7^ + ^'^r . — — = 0, (2.6) 



dry^ dr] I dr] dr] dx^ 1 dx^ ■'^ dr] dr] 



where ^^''f*;, are the connection coefficients of the unperturbed three geometry 'jij. It is very 
convenient to consider an observer located at the origin of the spatial coordinates, in which 
case we are interested in rays that focus at = 0. For such rays, dx/d?7 = — 1 + 0(\E') and 
dO/dr] = 0(\l/) with an equivalent result for 0. Making use of these results in Eq. ()2.6|) . and 
evaluating the background connection coefficients, we find 
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d^Y d\l/ 
— + 2 — 
dr/^ drj 

d^ _ ^ dln/x(x) ^ 2 
dr/2 dx dr] 90 

dV _ , dln/x(x) d0 ^ _^ i_d;l 

df]'^ dx df] /l'(x) sin^ 6* 90 

which determine the perturbed rays up to first order in 

Equation ()2.7|) has dx/d?7 + 2\& as a first integral, and this must equal —1 by the null condition 
for the perturbed ray. Integrating again, we find 

X = r]o~r]- 2 [\d7]', (2.10) 

where the integral is along the ray. Since we are only working to first-order in we can 
evaluate the integral along the unperturbed path 6 = const., = const, and X = Vo ~ V- 
(This is the Born approximation.) Integrating to a fixed t], Equation ()2.10|) implies a radial 
displacement. Alternatively, integrating to a fixed x implies a variation in the conformal time 
at emission; this is a time delay if the potential is negative on-average along the ray, i.e. 
it passes mostly through overdense regions. At the last-scattering surface, the r.m.s. radial 
delay is ~ 1 Mpc and is coherent over very large angular scales (j4a). Despite the delay being 
relatively large, the effect on the CMB is only small due to geometric suppression effects and 
the large coherence scale. If we think of plane-wave fluctuations, at a given angular scale 
the dominant plane waves to contribute to the CMB fluctuations have their wavevector 
perpendicular to the line of sight (and have magnitude k ~ l/fxix*) where fxix*) is the 
comoving angular-diameter distance to last scattering). A radial displacement has no effect 
on such plane waves since the displacement then lies in the wavefront. The largest effect of 
the radial displacement in the CMB power spectra is actually felt through its cross-correlation 
with the transverse lensing displacement, but is still well below 10~^ of the primary power for 
the temperature and polarization at / ~ 1000 (where its contribution peaks), and below 10"'^ 
for the cross-power between temperature and polarization (0). Since these effects are much 
smaller than those due to the transverse deflections, we shall not consider radial delays any 
further. 



0, 
0, 
0, 



(2.7) 
(2.8) 
(2.9) 



Equations ()2.8|) and ()2.9|) can be integrated twice back to a conformal time r/o — ?7* = X* using 
the zero-order result x = Vo~V since it only appears in the argument of a function multiplying 
first-order terms. Then, changing integration order, the integral over / /x^('7o ~ rj')drj' can be 
done using the explicit form in Eq. ()1.1|) . The result is (|47l ) 



Hvo - X* 



^ fK{x*)fK{x) de^^^''''^' 

^un; Vo - X), 



. fK{x*-x) 
lo ^ fK{x*)fK{x)^r,'edc^ 



(2.11) 
(2.12) 
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where 60 and 0o label the line of sight n. The small angular displacements 6 — 60 and — 0o 
form the components of the displacement vector a on the sphere. Noting that ae = 6 — 60, 
but = sin^ 9{(f) — (po), we find 

a = -2 r dx /^/^;7f\ v^M/(xn; Vo - x) (2.13) 

-'0 Jk{x*)Jk{x) 

where n is the covariant derivative on the sphere. This derivation of the deflection agrees with 
the heuristic derivation given earlier since we can identify the transverse derivative in Eq. ()1.3p 
with [l//;,(x)]Vfi. 

This result actually holds for any theory described in differential geometry with a metric 
conformally equivalent to Eq. ()2.ip . General Relativity specifically only enters through the 
relation between the Weyl potential and the stress-energy tensor, as determined by the Einstein 
equation. The result is the Poisson equation 

(A + 3ir/a2)^ = 47rG((fp + n) , (2.14) 

where 5p is the comoving total density perturbation (i.e. evaluated in the rest-frame of the 
total energy), 11 is the anisotropic stress, and A is the three-dimensional Laplacian. In matter 
(and dark energy) domination where lensing is important the anisotropic stress is small and 
the Weyl potential is then directly related to the comoving matter perturbations. 



2.2 Geodesic deviation 



An alternative way to describe gravitational lensing concentrates on the gravitational distor 
tion of the invariant cross-section of infinitesimal bundles of light rays (48; 4^ 3^ 24; 5^ 2 



511:1401). The geodesic deviation equation can be used to follow the evolution in the separation 
of nearby rays. The advantage of this approach is that it deals exclusively with physically- 
observable quantities; the disadvantage is that by not following rays at finite separation it is 
not straightforward to characterize the global properties of the light cone. This means that 
some important properties of strong-lensing situations (i.e. where there is multiple covering of 
the wave front) such as the number of images a source forms, are not readily calculable. How- 
ever, here we are largely interested in weak lensing and in this limit the approach is essentially 
equivalent to that given above based on the perturbed geodesies. 

We shall express the propagation of the cross-section of the bundle of rays from the 1+3- 
covariant viewpoint of a field of observers with 4-velocity u"". Working in the geometric-optics 
limit, we consider two null geodesies x°'{X) and x"(A) + 5x"(A) that lie in the past light cone 
of some event A (so all generators of the light cone focus at A) . Here A is an affine parameter, 
which we assume takes the same value at A for all rays there. In addition, we chose A such that 
Madx"/dA varies smoothly across the geodesies at A. For rays with infinitesimal separation, 
the connecting vector ^"(A) = 5x°'{X) is then also infinitesimal and lies in the null surface. 
This means that ^""ka = where k"' = dx"'/dX is the wavevector. The equation of motion of 
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the connecting vector is D^"/DA = k'^Vb^"' = ^^Vfofc" where the right-hand side involves the 
(covariant) difference of the wavevectors at A on the two geodesies. This follows since, in some 
arbitrary coordinate chart, 



B^^' dC^ 



DA 



^k^'{x + 5x) - k^ix) + V.pCk" 

^Cd.k^ + r^.Ck" 



(2.15) 



We now introduce the projected connecting vector ^" that connects the two geodesies in the 
local rest space of an observer with velocity u"', i.e. ^±Ua — 0. In general, will differ from 
^" by a vector proportional to k"" (a displacement along the geodesic): 

f ■ V 

C±-C- j—k^- (2.16) 
This can be written in terms of the screen projection tensor, 

nj — I I kahj c^'^iakh) , _x 

riab = 9ab - UaUb + CaCfe = Qab + TT ^ " (2.17) 

where /c" — {k-u){u"'+e"') with the (unit) direction e" perpendicular to it", as — 'H'^bi^- The 
screen-projection tensor projects perpendicular to both and e", so it projects into the two- 
dimensional rest-space of -u" that is perpendicular to the spatial direction of the ray. Under a 
(local) change of velocity -u" ^ -u", the new projected connecting vector is obtained from that 
with respect to u"' by applying the new screen projection tensor: CI CI = ^"ftC^ = ^"ftCi. 
Since C" and ^1 differ only by a (null) vector parallel to A;", dot products between projected 
connected vectors for neighbouring rays are independent of the 4- velocity and so the shape 
and size of the cross-section of a ray bundle are observer- independent. 

The propagation equation for C" follows from that for we find 

" ^2.18) 

where we have used k'^'Vbk"' — 0. The right-hand side involves the fully projected (2-dimensional) 
tensor Tia'^Tib'^V ckd- This is symmetric since, in the geometric optics approximation, ka = 
where S is the phase of the radiation field. The symmetric tensor can be decomposed irre- 
ducibly into a trace part and a trace-free part, 

na'-Hb^V^ckd) = e-Hab - (Tab, (2.19) 

where 9 = Vka defines the rate of expansion of the beam cross-section, and the trace-free 
shear defines the area-preserving distortion. These identifications follow from substituting 
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Eq. into Eq. (ITTH|) : 

nab^ = e^^a-aabe±- (2-20) 

The expansion 6 is thus half the fractional rate of change of invariant area of the ray bundle. 
If we differentiate Eq. fl2.18|) again and use the Ricci identity, 2V[aV6]/cc = Rabc'kd, we obtain 
the geodesic deviation equation for the projected connected vector in 1+3-covariant form: 

D / D^'' \ 1 

Ha'^ [^cb^ \ = na'e±k''k''Rdbce = HaXxk^k^Cdbce ' -C±aRbck' k'' . (2.21) 

In the last equality, we have decomposed the Riemann tensor in the source term into a part that 
depends on the Weyl tensor, Cabcd, and a part that depends on the Ricci tensor. Rat- The former 
describes the tidal gravitational action of nearby matter, while the latter is only operative 
where there is matter (including dark energy) present by the Einstein field equations. The ten 
degrees of freedom in the Weyl tensor can be represented by two projected (perpendicular to 
u"') symmetric trace-free (PSTF) tensors E^b ('electric' part) and Hat ('magnetic' part): 

Eab = W^U^Cacbd 5 Hab = -^acdCbe'^U^ , (2.22) 

where eabc = VabcdU'^ is the projected alternating tensor. Writing the Weyl term in Eq. ()2.2ip 
in terms of the electric and magnetic parts, we find 

Ha'— [Hcb^j = -2{k ■ ufeA^i^a'Hb)''E,, - n^a%)''Hj - -UaRbck'k', (2.23) 

where eab = ^abc^'^ and the angle brackets denote the PSTF part taken in the 2-dimensional 
space defined by the projection Hab- 
it is convenient to express in terms of a two-dimensional ortho normal spacelike basis along 
the ray, and E2, where Ti."'bE\ = Ef, I = 1,2. If we transport these vectors according to 
TiabDiJ^/DA = (i.e. as parallel as the projection properties allow), Eq. (j2.21|) can be written 
as 

^ = T,Aj, (2.24) 

where ^/ = —Ef$,^a and the (symmetric) optical tidal matrix is Tjj = —EjEjk"'k'''Rabcd- 
Given the initial condition that = at the focus A, the solution of Eq. (|2.24j) can be 
written as (|38[ ) 

0(A) = -ik- u)a^Vij{\) ^ = Vu{\)5ej, (2.25) 

dA ^ 

where the Jacobi map satisfies 

= TikVkj, (2.26) 

with the initial condition Vij\a = and dP/j/dAI^ = —{k ■ u)a5ij. In Eq. ()2.25p . we have 
introduced the (screen) components SOj of the infinitesimal angular separation of the rays 
at A] these are equal to —{k- u)']^(i^i / dX\A where the proportionality constant arises from 
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converting dA to a proper-time interval. The optical expansion and shear can be determined 
from the Jacobi map (except at caustics where T>ij is not invertible) by 



-^V],^j = e5u + aij, (2.27) 

where we have used Eq. ()2.2()j) . We shall see shortly that for weak lensing in a linearly- 
perturbed Robertson- Walker geometry we can relate the symmetric trace-free (STF) part of 
the Jacobi map to the lensing shear, and the trace to the convergence. 

We end this subsection by considering the effect of a change of 4-velocity u"" on the Jacobi 
map. If we change u"^ ^ u"', then, given the basis Ef (appropriate to m") along the ray, the 
basis Ej = H"'bEj is appropriate to u°' since it satisfies 7iab^E'}/DX = 0. Under both changes, 
the components of are invariant but the components of the angular separation of rays at A 
transform to 69 j = {k ■ u/k ■ u)aSOj. It follows that the Jacobi map must scale so as to leave 
■ u)a independent of u"". This behaviour is seen to be consistent with its differential 
equation on noting that Tjj is invariant but the initial condition on dVjj/dX scales with 
k ■ u\a- Finally, we note that the components of the optical expansion and shear are invariant; 
in 1+3-covariant form the shear transforms as aab ^— > d'ab = 'Ha'^7ib'^<Jcd, which leaves the square 
(JabO'"'^ invariant. 



2.2.1 Jacobi map in linearly-perturbed cosmology 

The symmetry of the background Robertson- Walker model demands that the Jacobi map 
must be cx 6ij. The determinant of the map is the (invariant) area of the ray bundle per unit 
solid angle at the focus A and so 

Vjj{X) = a{\)fK{x)Sij, (2.28) 

where a(A) is the scale factor at affine parameter A and x is the comoving distance the 
ray then travels to A. The trace-free part of Vjj thus vanishes in the background and so 
provides a gauge-invariant description of gravitational lensing in perturbed Robertson- Walker 
cosmologies. For weak lensing, the STF part describes the shear of images and can be related 
to V(aQ;b), where is the deflection angle, as we show shortly. The antisymmetric part of 
Vjj describes rotation of images; it vanishes to linear order in the perturbations by virtue of 
Eq. fj2.26|) . the initial conditions, and the fact that the optical tidal matrix is symmetric and, 
at zero-order, proportional to the identity matrix. 

The STF part, of the Jacobi map evolves according to the STF part of Eq. ()2.26p . To 

linear order in the perturbations, we find 

^A^ - afK{x) dA^ = «/i^(x) V), (2.29) 

where we have used Eq. ()2.28|) and the fact that the STF part of the optical tidal matrix, T^j^ 
vanishes in the background (it depends only on the Weyl curvature). For scalar perturbations. 
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the magnetic part of the Weyl tensor vanishes and the electric part can be written as a 
gradient term: Eab = —D(a^b)'^, where the derivatives here are projected perpendicular to 
The Weyl tensor is gauge-invariant and is generated from the sum of the metric perturbations 
in the conformal Newtonian gauge: \l/ = (\I/n — $n)/2. Expressing the Weyl tensor in terms 
of we find 

7^/j)=2(A;-m)2E|^,E^^E,, = -2(^-|-^^ V(/Vj)^, (2.30) 

where V(/Vj)\l/ involves the components of the (spherical) covariant derivatives of the pro- 
jection of \1/ at A along the line of sight onto the celestial sphere. It is convenient to define 
7/j = X'(/j)/[a/i^(x)] (or, more generally, j)/ v^detP, which is independent of m°). Taking 
the derivative, and using the linearized form of Eq. (j2.27|) . we can relate 7/j to the optical 
shear: d77j/dA = ajj. If we substitute for in Eq. (j2.29|) . we find 

where we have switched to conformal time using adt] = dx"'Ua = {k ■ u)dX and used the 



zero-order result x = Va — V- ^'^ follows that near to the focus, ~ 0{\a — A) ()38f ) and 

7/J ~ 0{\a — A)^. With this boundary condition, the solution of Eq. ()2.31|) is 

7,j(n) = -2 r dx /''/^;7f\ V(,Vj)Vl/(xn; r/^ - x), (2.32) 

where we have integrated back to x*- Comparison with Eq. ()2.13j) shows that 7/j = V(/aj). 

We now turn to the trace of the Jacobi map, which we denote by P = J^i^^ii- This evolves 
as 

^ = -l-VRabk'^k' (2.33) 
dA"^ 2 

to first order. Note that V does not vanish in the background but instead is given by = 
2a(A)/A-(x), ciiid the Ricci term in Eq. ()2.33|) also has a zero-order part. It is very convenient 
to remove that zero-order part of the Ricci tensor due to the expansion by working in the 
Newtonian gauge and performing the same conformal transformation that we made in Sec. 12. II 
9ab ^ dab = ^"^Qabi whcrc the conformal factor Vt^ = [a~^(l— 2$n)]. The conformal Ricci tensor 
has non-zero components 

^oo = 2D^Di^ , A, = 2i^7,, - 2DiD,^, (2.34) 

where Di is the three-dimensional covariant derivative constructed from 7,^- and D* = Y^Dj. 

Under a conformal transformation, ka = VaS* is invariant so that k"- = Vt~'^k'^ and so, with 
fc" = dx"'/d\^ d\ = f2^dA. For an observer following a path x"(r), the conformal 4- velocity is 
= Q~^u"' and the conformal increment in proper time is df = Qdr. The Jacobi map is a 
ratio of proper distances to angles so, for the same observer, the trace V = QV where Q is 
evaluated at the source position. 
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The conformal trace evolves according to the 'hatted' version of Eq. ()2.33|) . where, for an 
observer at the origin, the Ricci source term is 

= 2{k . uf [k + Vl^ + 2^^^^^!^) , (2.35) 

and Vi = /^^(x)V^V/ is the (comoving) transverse Laplacian. We define the first order 

quantity k = 1 — 'D/[2fK{x)] where x is the perturbed radial distance, so that Eq. ()2.33|) 
becomes 

[2Mx).]-2^IM = vCk-ur(K + Vl^ + 2^^^^^^^ (2.36) 



dA2 ' ''^^ ' dA2 V dx dx, 

We now change variables from A to x- To first order we have dx/dA = —{k-u) from the null 
constraint on /c", and the x component of the geodesic equation gives 

^^-(..«mx)(/<-.2^|). (2.37) 

Using these results and substituting V = 2/x(x)(l ~ i^), Eq. ()2.36|) simplifies to 



d^ 



dx^ 

which can be rearranged as 



I/j<-(x)k1 = Mx) (Vi* - A'k) , (2,38) 



^(/i(x)|)^V'V,*. (2,39) 

Near the focus at A, V = 2x + C^(x^) so that k = C(x)- Integrating twice with a switch of 
order, and using the boundary conditions at A, we get 



K n 



r /^/^; f\y Wj^{xn; VA-X)- (2.40) 
•^0 Jk{x*)Jk{x) 



Since f) = VlV in the non-conformal frame we can define the convergence in terms of the trace 
of the Jacobi map at first order as 

^^(n) 

2/x(x*)'^(x*n;r/*) 

where S'(x*n; r/*) = a{r]^:)[l + $N(x*n; r]^)] measures the perturbed scale factor at the source. 
Note that the x* here in /x(x*) must be the perturbed value, including the radial displacement 
due to the time delay. The convergence is measuring only the effect on the angular area due 
to transverse gradients rather than including the effect of the perturbed volume element at 
the source or the change in distance relative to the background due to the time delay. 

This completes our derivation of the shear and convergence from the Jacobi map in linearly- 
perturbed Robertson- Walker cosmologies. It is straightforward to show that the result is con- 
sistent with that implied by the lensing deflection angle and radial delays derived in the metric 
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approach in Section ITTl Note that derivations in e.g. Refs. ()48l : ISOl) appear to have neglected 
that time delay and 1 + terms required for a consistent definition of the magnification 
matrix in terms of the Jacobi map to first order. 

2.2.2 Lensing deflection angle 

The Jacobi map is directly observable and, as we have seen, for weak lensing the STF part 
can be related to the derivatives of the lensing defiection angle, where the latter is calculated 
by ray-tracing in the conformal Newtonian gauge. This raises the question, can we determine 
oc directly from the observable 77J for weak lensing, i.e. can we solve 

lab = ^(aab) (2.42) 

on the sphere? Any vector field can be written as a gradient and a divergence-free vector: 
tta = VaV' + Ca^VfeX; wherc ip ^ud X are scalar functions on the sphere and eat is the alternating 
tensor on the sphere. Substituting in Eq. ()2.42|) . we have 

7ab = ^(a^b)i^-e\a^b)^cX- (2-43) 

Taking the divergence and then taking the divergence or curl again, we find 

V»VS,, = ^V2(V2 + 2)^, e'^^VeVV = ^V2(V2 + 2)x. (2.44) 

On the full sphere, the (regular) kernel of V^(V^ + 2) is four-fold degenerate and consists of 
the / = and / = 1 spherical harmonics. It follows that the defiection field is determined 
up to a dipole vector by the observed jab- For the undetermined dipole vector, the curl-like 
part corresponds to a global (rigid) rotation and hence is unobservable. The / = 1 gradient 
part does not contribute to the lensing shear, but does contribute to the convergence given by 
K, = — V"aa/2 = —V'^ip/2. From Eqs. ()2.40|) and ()2.32j) we can see that this is consistent with 
our definition of k in terms of the Jacobi map given in Eq. (j2.4H) for / > 1. Since is only 
degenerate for the monopole, the observed convergence can be inverted to recover the dipole of 
the lensing potential that we cannot get from the shear. Note that the measured convergence 
and hence recovered dipole of the lensing potential and defiection angle do depend on the 
velocity of the observer, as discussed further below. 

2.3 Gauge dependence and observable dipoles 

It is important to understand how the observables depend on the velocity of the observer, v. 
First, consider the unlensed CMB. The main (9(v/c) effect is the generation of a significant 
dipole from the monopole. At next order, (9(v/c)(9(AB/0), there are additional effects due 
to the angular aberration of the anisotropics due to the change of frame: there will only be one 
choice of v for which there is no aberration. A boost from this frame will produce a dipole- 
like rotation of directions on the sky, giving an effective magnification and demagnification in 
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opposite directions, leading to the spacing of the acoustic peaks to be different on opposite 
hemispheres. In addition (9((v/c)^) relativistic aberration of the monopole will give rise to 
a frequency dependent kinematic quadrupole (and higher moments), though this will not 
concern us here (see Refs. [52; .53,) for more details). The frame in which the dipole vanishes 
will not in general be the same as the frame in which there is no aberration due to the presence 
of a primordial dipole contribution, and hence the difference is a gauge-invariant observable. 

In linear theory, the observed dipole can be expressed in terms of a gauge-invariant source 
at last scattering, a gauge-invariant ISW contribution, and a local term that depends on the 
shear. Hence in the frame in which the shear vanishes, the observed dipole will be just the 
cosmologically interesting gauge-invariant contributions. The shear vanishes in the Newtonian 
gauge, and hence the Newtonian gauge is the the one in which observers see no local kinematic 
dipole. 

Now consider the effect of lensing. A dipole lensing convergence has the same effect at lowest 
order as angular aberration due to local motion, magnifying and demagnifying in opposite 
hemispheres. Our calculations of the observed angles of geodesies must include any first-order 
angular aberration effect. The explicit results we calculated in the conformal Newtonian gauge 
show that the Jacobi map contains only terms evaluated at the source or integrated along the 
line of sight. This suggests that that there can be no local kinematic angular aberration 
for a Newtonian gauge observer, consistent with the result above that the kinematic dipole 
vanishes in this frame. Newtonian gauge observers see no local kinematic dipole and no angular 
kinematic aberration. 

For theoretical work it is very convenient to work in the frame in which there is no local 
angular aberration. This is the gauge in which the shear vanishes: the Newtonian gauge. This 
is why we have consistently used the Newtonian gauge, and will continue to do so. In this gauge 
$7V determines the perturbation to the 3-curvature and the integrated local 3-expansion; 
determines the acceleration; their difference determines the first-order gauge-invariant Weyl 
tensor, and their sum depends o„ the anisotropic stress (see e.g. the appendix of Ref. &). 

Our actual motion with respect to the CMB will be somewhat non-linear, and the observed 
dipole and angles need to be corrected to correspond to the Newtonian gauge predictions. 
The non-linear effects can be taken out easily by boosting to some frame in which the total 
observed dipole is zero or small (|55l ) . In principle we could then calculate the Newtonian gauge 
dipole by determining the local shear of the new local spatial hypersurface and calculating the 
transformation required so that it vanishes. In practice we probably cannot do better than 
boosting to the frame in which the total dipole is zero, which will differ from the Newtonian 
gauge by ~ (9(10~^) assuming the primordial Newtonian gauge dipole has the expected ampli- 
tude. We should then be able to measure the Newtonian gauge convergence dipole to an error 
of ~ O{10~^), which should be useful since it is expected to be 10~^-10~^: the convergence 
dipole should be measurable to an error much less than cosmic variance. 

In summary, the lensing effect of the CMB is real and observable. Only the dipole convergence 
is sensitive to the gauge, and the Newtonian gauge dipole convergence is measurable to good 
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accuracy in practice. Changing frame merely correspond to adding some angular aberration 
to the Newtonian gauge result. 

2.3.1 Other gauge effects 

It is worth commenting on the other potential gauge issues that arise in a calculation of 
the weak lensing effect on the CMB. We are interested in that part of the higher-order CMB 
anisotropy that arises (approximately) from the projection of fluctuations over a spacelike last- 
scattering surface. Consider making a consistent second-order calculation of this projection. 
The possible second-order effects are then: (i) the projection of second-order corrections to the 
source terms (e.g. photon energy and momentum density) evaluated over the zero-order last- 
scattering surface with the zero-order light cone; (ii) the projection of the first-order sources 
over the first-order perturbed last-scattering surface with the zero-order light cone; and (iii) 
the difference in the projection of the first-order sources over the zero-order last-scattering 
surface when the first-order perturbed light cone is used instead of the zero-order one. The 
sum of all three effects is gauge-invariant but the individual effects are not separately since they 
depend on how we identify points in the perturbed universe with those in the homogeneous 
background (on which we think of the perturbations as sitting). This identification defines 
a choice of gauge. As an extreme example, consider fixing the gauge so that for our given 
observation point, the zero-order light cone coincides with the real perturbed one^. Then 
effect (iii) vanishes, which is what we would naively think of as the lensing effect on the CMB. 
The light cone still has an observable perturbed geometry, but in this gauge it arises only 
from the local metric fluctuations rather than as an integrated effect. Note that the gauge 
transformation to this observational gauge is non-local so these two viewpoints are consistent. 
In the observational gauge, CMB lensing appears in effect (i) above since, under the gauge 
transformation, second-order source terms — that depend non-locally on the fluctuations 
along the line of sight — appear from displacing the first-order source fields by first-order 
amounts in the background. 

In the generalization of the conformal Newtonian gauge to second-order (or any other locally- 
defined gauge), the second-order fluctuations are sourced by non-linear physics in the prior 
evolution and any additional primordial contribution. Of the three secondary effects identified 
above, (iii) is expected to dominate though this has not yet been proved rigorously with a 
complete second-order calculation. It is this effect that we have computed here and what we 
call the lensing effect on the CMB. 

2.4 Lensing of polarization 

While CMB photons propagate freely from the last scattering surface, their number density in 
phase space (i.e. the one-particle distribution function) is conserved. In terms of the specific 
intensity /j, seen by a field of observers with velocity u"', the conserved phase space density 

^ This can be achieved by adopting the observational coordinates of Ref. (;56i ) and references therein 
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implies that lu/v^ remains constant ()57ri. where v is the photon frequency. The polarized 
specific intensity evolves in exactly the same way provided that it is measured on a basis Ej 
that evolves along the geodesic as ?iafeDi?j/DA = (j58lf). If we trace a null geodesic back to 
last scattering in the perturbed universe, there are several geometric effects that alter the 
observed polarization from that in the absence of lensing. First, as with the temperature 
(or total intensity), we are probing fluctuations on the last scattering surface at a different 
position. The transverse displacement is what is usually referred to as the gravitational lensing 
effect and dominates over the effect of the radial displacement Second, the propagation 
direction of the perturbed geodesic when it reaches last scattering differs from that of a 
photon propagating along an unperturbed, but displaced, line of sight n + a. The difference 
arises because the propagation direction accumulates only the lensing deflection angles but 
the lensing displacement a. weights the deflections by the ratio of angular diameter distances 
from lens to source and observer to source. The difference is of the same order as the deflection 
angle (~ few arcmin) but is much smaller than the angular scale of the polarization at last 
scattering which is quadrupolar and so should not be important (though detailed calculations 
of this effect have not been reported in the literature). Note that the same process affects the 
Doppler contribution to the temperature anisotropies. The flnal geometric process to consider 
is that in propagating the polarization basis along the perturbed line of sight n, the resulting 
vectors at last scattering will be rotated relative to an unlensed basis transported along n + CK. 
Provided we obtain the basis at n + ck by parallel transporting that at n along the spherical 
geodesic connecting the two points, the rotation angle will be ~ 1 arcmin and the effect should 
also be negligible. In summary, to a good approximation, the lensed polarization along n is 
the same as the unlensed polarization that would be observed along n + o; on bases that 
are related by parallel transport between the two lines of sight. We describe lensing of CMB 
polarization in more detail in Sectional 



2.5 Higher-order lensing 

The lowest weak-lensing approximation gives the deflection angle to flrst order in the Weyl 
potential The unlensed CMB is assumed to be flrst order, so the lensed CMB B(n) = 
G(n + ck) is in a certain sense second order: it is linear perturbations at last scattering lensed 
by linear perturbations along the line of sight. However B(n + o:) = G(n) + a°Va0(n) + . . . , 
with equality only to lowest order in \1/ (second order in total). It therefore differs from 
a second-order perturbative expansion of O by higher order terms. As we shall discuss in 
Section EJ these higher order terms are in fact not negligible. To get an accurate result from a 
perturbative expansion in ol one would need to go to at least third order and probably higher. 
Of course a fully consistent second-order analysis would also pick up additional physical effects 
(for example second-order kinetic-SZ and the small effect of second-order corrections to the 
sources at last scattering). However if these corrections are small (or can be added separately), 
the present analysis is likely to be much more accurate (and certainly very much easier) than 
evolving the Boltzmann equation to second or higher order. 

As we discuss in Section 13. 3| late-time non-linear evolution of \1/ can be included within 
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the basic linear-\l/ framework by simply replacing \Ef with its non-linear form. However a 
full analysis to higher order in \E' must go beyond the Born approximation, accounting for 
deviations of the actual photon path from the undefiected path and multiple scattering. 

In the first-order approximation, the deflection angles have zero curl and can be described 
as the gradient of a potential, cx = Vtp. However at next order in this is no longer true, 
and in general a has a curl-component, an effect called field rotation. This is represented 
by the rotation u in the magnification matrix, Eq. ()1.4|) . This lens-lens effect appears in the 
second-order Born approximation from scattering from two sources (hence (9(\1'^)). To see 
this consider applying the magnification matrix of Eq. (jl.4p consecutively for two different 
sources with = for each source individually. The resultant magnification matrix is given 
by the product of the two matrices, which is now no longer symmetric in general, and hence 
the overall effect includes some rotation. In addition to introducing an antisymmetric term in 
the magnification matrix, u, the curl of the deflection angle also means that the shear now 
has a curl component (Eq. ()2.43p ). This is the B-mode shear often used as a diagnostic in 
galaxy weak lensing surveys because it is expected to appear only at second order. 



The power spectrum and effect of the field rotation have been calculated in Refs. (|43l : |27|). 
Because it only appears at second order its power spectrum is fourth order in the potential. It 
is around 10^^-10^^ times smaller than the convergence power spectrum, with most effect on 
small scales. It is therefore likely to be much less important than uncertainty in the non-linear 
potential evolution that gives a much larger effect on these scales. In principle the field rotation 
does provide a limit to how well the i?-mode lensing signal can be subtracted (see Section |H1), 
but is not likely to be important at noise level s > 0.25/iK-arcmin (0). Field rotation can also 



arise from lensing by gravitational waves (4^ 5^, though probably at an unobservably small 
level. 



Mo re g eneral studies of second-order corrections to the lensing have been performed in Refs. (j& 



61l: l28l). For example there is a Born-correction accounting for the transverse displacement of 
the ray from the undefiected path ^ : this can be included at second order in \E' by performing 
a transverse Taylor expansion of the potential about the undefiected path. This and other 
corrections are generally very small on the scales relevant for the CMB. 



2.6 Lensing in other gravity theories 



Any metric gravitational theory than can be described by the metric of Eq. 1)2.11) would have 
the same predictions for the lensing effect in terms of the Weyl potential that we obtained in 
General Relativity (GR). How the potential relates to the matter fields can of course change 
if Einstein's field equation is changed (for an example in the lensing context see Ref. (j|62l)). 

^ The Born correction usually refers to evaluating the first-order integral expression for the magni- 
fication matrix along the perturbed line of sight. If instead the deflection angle were evaluated along 
the perturbed line of sight, and then the angular derivatives were taken to form the magnification 
matrix, this would automatically include the lens-lens coupling. 
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Other more general theories may give different results. The relativistic version of MOND 
due to Bekenstein (j6^ is a bi-metric theory, however in this case the photons are still null 
geodesies, and many of the predictions for lensing are in fact the same as GR (jc^) (though of 
course the relation to the matter is different). 



3 The lensing potential 

In this section we define the lensing potential, an effective integrated potential useful for cal- 
culating the effect of weak lensing on the CMB anisotropies. We then calculate the power 
spectrum of the lensing potential, and discuss non-linear corrections and small scale approxi- 
mations. In later sections we shall use the lensing potential extensively for computing lensing 
deflection angles and their covariance. 

We recall from Eq. ()1.3p that the deflection angle of a source at conformal distance x* is given 
in terms of the potential ^ by 



The quantity r/o ~" X is the conformal time at which the photon was at position ^n. It is 
convenient to write V_l\E' = n^) / fi-cix) so that 



where Vn represents the angular derivative, equivalent to the covariant derivative on the 
sphere defined by fi. This result was derived rigorously at lowest order in Eq. ()2.13p . We then 
define the lensing potential, 



so that the deflection angle is given by Vfi^. From now on we write this simply as Vip. 
Note that the lensing potential appears to be formally divergent because of the 1/x term near 
X = 0. However this divergence only affects the monopole potential, which does not contribute 
to the deflection angle. We may therefore set the monopole term to zero, and the remaining 
multipoles will be finite, at which point the lensing potential field is well defined. 

For the CMB we can approximate recombination as instantaneous so that the CMB is de- 
scribed by a single source plane at x = X*- We neglect the very small effect of late-time sources, 
including reionization, so a single 2D map of the lensing potential on the sphere contains all 
the required information. For scales on which the potential \E' is Gaussian, the lensing poten- 
tial will be Gaussian. On smaller scales non-linear evolution can introduce non-Gaussianity 
even for Gaussian primordial fields, however on acoustic scales this is a small correction so we 
defer discussion of non-linear evolution to a later section. 








(3.3) 
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For a flat universe fxix) = X? fo^' simplicity we shall assume flatness from now on. The 
geometry only enters into the calculation of the lensing potential (and unlensed CMB) power 
spectra, and the generalization to a non-flat universe is straightforward. Once the lensing 
potential has been computed deflection angles etc. may be derived without further reference 
to the FRW geometry. 

The lensed CMB temperature in a direction fi, T(n), is given by the unlensed temperature 
in the deflected direction, T(n) = T(n') = T(n + a) where o: is a deflection angle. At lowest 
order it is a pure gradient, ct = Vip. 

3. 1 Power spectrum of the lensing potential 

For a Gaussian lensing potential its power spectrum and the cross-correlation with the CMB 
contain all the information we need to describe fully the statistics of the lensed CMB. We 
shall therefore now derive a result for the lensing potential angular power spectrum in terms 
of the power spectrum of the potential. 

We expand 3-D fields into harmonic space using the Fourier convention 

^(x;r^) = /^^v[;(k;r^)e^'^-, (3.4) 
and define the power spectrum for the assumed statistically homogeneous potential by 

(*(k; r7)^*(k'; r/')) = ^^^.(fc; r/, V)5(k - k'), (3.5) 

where rj denotes conformal time. Using Eq. ()3.3p the angular correlation function for the 
lensing potential is then 

{i*(fi)V'(fl')) = 

4 r-dx r-dV {^) f^) /|i3^P,(*;,,,>-e— . (3,6) 

-'o Jo \ X*X ) \ X*X J J (2vr)3 

We now use the result 

e^^-- = AT,Y.^'3i{kx)Y:^{rv)YU^^) (3.7) 

Im 

where ji{r) is a spherical Bessel function given in terms of standard Bessel functions by ji{r) = 
(7r/2r)^/^ Ji+i/2(r). The angular integral over k can then be done using the orthogonality of 
the spherical harmonics and we have 

(^(n)V'(n')) = 16vr ^ T d^ T dx' (^^] (^^) 
iit^'-^o Jo \ X*X J \ X*X' J 

yJ/(A;xh'(^x')^*(^;^,V)n^(n)F;„,(n')5z/'5„„'. (3.8) 
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1 10 100 1000 

Fig. 2. The power spectrum of the deflection angle (given in terms of the lensing potential ip by 
Vip) for a concordance ACDM model. The linear theory spectrum (solid) is compared with the same 
model including non- linear corrections (dashed) from HALOFIT (|6f)l ). 

The lensing potential can be expanded in spherical harmonics as 

^(n) = ^V/m^m(n), (3.9) 

Im 

and for a statistically isotropic field the angular power spectrum Cf is defined by 

ii^lm'iPvm') = Sll'Smm'Cf. (3.10) 

Taking the spherical harmonic components of Eq. ()3.8|) we therefore get 

dk fX* f^* , ,^ n r.-n \-nn(x*-X\(x*- X'\ 



*A / \ A*A / 

(3.11) 



In linear theory we can define a transfer function T^,{k\ri) so that \['(k;?7) = T^i{k\ri)TZ{k) 
where 7?-(k) is the primordial comoving curvature perturbation (or other variable for isocur- 
vature modes). We then have 



2 

(3.12) 

where the primordial power spectrum is Vnik)- Given some primordial power _spectrum this 
can be computed easily numerically using Boltzmann codes such as camb^ ()6fil ). Since it is 
deflection angles that are physically relevant, it is usual to plot the power spectrum of the 
deflection angle Wifj given by /(/ + 1)C/', as shown for a typical model in Fig. |21 Note that for 

^ http: //camb. inf o| 



= 167r / ^Vn{k) r dxT^{k-m-x)ji{kx) 

J K Jo \ x*X 



X 
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10 100 1000 10 100 1000 

I I 

Fig. 3. Cumulative contribution of different redshifts to the power spectrum of the lensing potential 
for a concordance ACDM model. Note we have used a log scale for Cf in the left-hand plot, but 
linear in the right-hand plot. 




10 100 1000 



Fig. 4. Contributions of different wavenumbers k (in Mpc ) to the power spectrum of the lensing 
potential for a concordance ACDM model. 

/ > 1 the Bessel functions go to zero at the origin, ji{kx) — ^ as x 0, so the / > 1 power 
spectrum is finite and well defined. 

The last scattering surface is a long way away, so the lensing potential has contributions out 
to quite high redshift as show in Fig. El Nearby low redshift potentials only contribute to 
the large-scale lensing, so the spectrum is only quite weakly sensitive to late time non-linear 
evolution. The contributions from different wavenumber ranges are shown in Fig. 0] 
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Fig. 5. Correlation of the lensing potential with the CMB temperature. 

The cross-correlation with the temperature can be worked out using an analogous deriva- 
tion (67), and a typical result is shown in Fig. El On very large scales the correlation is 
significant because of the ISW contribution to the large-scale temperature anisotropy caused 
by time varying potentials along the line of sight. However on smaller scales where the ISW 
contribution is much smaller, and the lensing potential becomes almost totally uncorrelated to 
the temperature. The defiection power spectrum peaks at / ~ 60, at which point the correla- 
tion is < 10%. For most applications, for exaraple computing the lensed CMB power spectra, 
this small correlation can be safely neglected ()68[ ). However the significant large-scale correla- 
tion does mean that we already know quite a lot about the large-scale lensing potential from 
observations of the CMB temperature. 

Note the lensing potential has physically relevant modes down to / = 1 (see Section . 



3.2 Limber approximation 



At high I the power spectrum Vii,{k) varies slowly compared to the spherical Bessel functions 
in Eq. ()3.11|) . which pick out the scale k ~ l/x- Using 

I k'dkjiikx)Mkx') = ^5ix - X'), (3.13) 
we can Limber-approximate Cf as 

Cf^^ rxdxV^il/x;Vo-x)(^^^] . (3.14) 
Jo V x*x J 

This approximation is rather good, and is used by the numerical code cmbfast ^ . On small 
scales it is numerically much more efficient than Eq. ()3.12j) due to the reduced dimensionality 
of the required integral. 

^ http: / /cmbfast . org^ 
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The power spectrum of the potential can be related to the power spectrum of the density 
perturbations using the Poisson equation, Eq. ()2.14j) . For a flat universe in matter or dark 
energy domination 



(3.15) 




where Vs{k; t]) is the power in the fractional comoving density perturbation, P{k; r]) is the 
conventionally defined matter power spectrum, H{rj) is the Hubble parameter, and flm{v) is 
the fraction of the energy density in matter. 



3.3 Non-linear evolution 

The most important assumption we have made so far is that the lensing potential is linear 
and Gaussian. On small scales this will not be quite correct due to non-linear evolution. The 
main effect is to change the \l/ power spectrum on small scales, and can be estimated from 
numerical simulations. For simple models, fits to numerical simulation like the halofit code 
of Ref. (|65[ l can be used to compute an approximate non-hnear, equal-time power spectrum 
given an accurate numerical linear power spectrum at some redshift. halofit is expected to 
be accurate at the few percent level for standard ACDM models with power-law primordial 
power spectra. For more general models, for example with an evolving dark energy component, 
further simulations are required (see e.g. (|69l)). Given a method for computing the non-linear 
power spectrum, a good approximation is simply to scale the potential transfer functions 
T<s,{k,ri) of Eq. ()3.12p so that the power spectrum of the potential has the correct non- 
linear form at that redshift: 



Since non-linear effects on C; are only important where the Limber approximation holds, 
Eq. ()3.1(j|l should be very accurate. 

The effect of the non-linear evolution on the power spectrum of the lensing potential is shown 
in Fig. 121 Although there is very little effect on scales where the power peaks {I ~ 60), non- 
linear evolution significantly increases the power on small scales. For future reference, this 
changes the lensed temperature power spectrum Cf by < 0.2% for I ~ 2000, but there are 
percent level changes on smaller scales. The effect on the i?-mode polarization power spectrum 
is more dramatic, giving a > 6% increase in power on all scales. On scales beyond the peak in 
the unlensed 5-mode power (/ > 1000) the extra non-linear power becomes more important, 
producing an order unity change in the 5-mode spectrum on small scales. On these scales the 
assumption of Gaussianity is probably not very good, and the accuracy will also be limited 
by the precision of the non-linear power spectrum. 




inon-linear 



(fc;r/) 



(3.16) 
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4 The lensed CMB temperature power spectrum 



Observations of the CMB temperature power spectrum are becoming increasingly accurate. 
The anisotropics at last scattering are expected to be closely Gaussian, and the power spec- 
trum, Ci, can be worked out accurately in linear theory. However we observe the lensed tem- 
perature field, which will have a modified power spectrum (as well as having a non-Gaussian 
structure). This effect must be taken into account to obtain accurate cosmological parameter 
constraints from CMB observations. 

We first describe a simple derivation of the lensed Ci using a series expansion in the defiection 
angle, which turns out to be a good way to understand the most important effects (|67[l. 
However it is not very accurate because on small scales the lensing deflection will be of the same 
order as the wavelength being deflected, and a perturbative expansion in the deflection angle 
is then not a good approximation. We therefore also discuss a non-perturbative calculation of 
the lensing effect by considering the correlation function Another approach to the 

lensed power spectrum is described in Ref. fflih . 

A calculation of the lensed power spectrum to within cosmic variance must take account of 
sky curvature, however this effect is small so we postpone discussion of curvature corrections 
to Section Non-linear evolution must also be included to get the small-scale spectrum 
accurately, and can be included in the Gaussian approximation by using a lensing potential 
computed using the non-linear power spectrum as described in Section 13.31 



4-1 Simple lowest-order calculation 

As a flrst calculation of the effect on the CMB, we compute the lensing power spectrum at 
lowest order in Cf . To do this we work in harmonic space, using the symmetric flat-sky 2D 
Fourier transform convention for the temperature fleld: 



We shall assume that the temperature fluctuations are statistically isotropic, which means 
that any statistical average cannot depend on where on the sky (or in which orientation) it 
is evaluated. The correlation function of the temperature at two points can therefore only 
depend on the separation between the points 




(4.1) 




(4.2) 
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With this assumption the covariance of the unlensed Fourier components is 



(e(i)e*(i')) = /^e-^'-V-'e(lx-x'i) 



d^x rd^r 



27r J 27r 

5(1' -1) / dVe'^"e(r). (4.3) 



In the second hne we changed variables to r = x — x', and have defined r = |r|. We define the 
power spectrum Cf so that 

Cf = J dhe^^-''^{r) = Jrdr J d0r e^'"™'('^'-'^-^^e(^) = '^t^ J rdr Jo{lr)^{r). (4.4) 
Here we have used the Bessel function Jn{r) that arises from integrating the result 

oo oo 

g.rcos,^^ J2 z"J„(r)e^"* = Jo(r) + 2^z"J„(r)cos(n0). (4.5) 



n=l 



The power spectrum for our statistically isotropic temperature field is therefore diagonal in 1, 
and is given by 

(e(i)e*(i')) = 5(1-1'). (4.6) 

We can perform a similar Fourier transform for the lensing potential ip, and define the corre- 
sponding power spectrum Cf for the fiat sky (equivalent to Eq. ()3.10p on small scales). 

Lensing re-maps the temperature according to 



e(x) = e(x') = e(x + v^) 

^ e(x) + v>(x) v,e(x) + |vxx) vV(x) v„v,e(x) + . . . (4.7) 

As discussed, the series expansion is not a good approximation on all scales. However it can 
be used to get qualitatively correct results for the lensed Ci, and is useful for giving a simple 
derivation to aid understanding. Introducing the Fourier transform of the lensing potential, 
ipO), we have 

V^(x) =^J^ ¥(l)e'■^ Ve(x) =^|^ 10(l)e^'-. (4.8) 

Taking the Fourier transform of 0(x) and substituting we get the Fourier components to 
second order in ip 

9(1)^6(1)- /^l'-(l-l')^(l-l')0(l') 

- ^ / ^ / 1^ + I2 - 1] lrl2e(li)V^(l2)V^*(li + h- 1). (4.9) 
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We now work out the power spectrum of the lensed field 0(x) to lowest order in Cf . We can 
neglect the {QiVjipiV}) correlation since the small scale temperature at last scattering is almost 
uncorrelated with the late-time potentials responsible for the lensing, and the correlation only 
has a tiny effect on the result. The covariance for the lensed temperature remains diagonal 
due to statistical isotropy, so that 

(e(i)e*(i')) = 5(1-1')^®. (4.10) 

At lowest order in Cf there are contributions from the square of the first order term in G, 
and also a cross-term from the zeroth and second order terms. Using ip{\) = we get 

This simplifies to the final result (|67l ) 

Cr ^ (1 - fR^)Cr + / 7^^ [1' ■ (1 - 1')]' Cl.fl (4.12) 



Here we have defined half the total defiection angle power 



R^^l{m') = ^Jjl'Ct, (4.13) 

where for typical models ~ 3 x 10^^, corresponding to an rms defiection of ~ 2.7arcmin. 
The integral term has the form of a convolution of the unlensed temperature spectrum with 
the lensing potential power spectrum. This convolution serves to smooth out the main peaks 
in the unlensed spectrum, which is the main qualitative effect on the temperature spectrum 
on large scales. The effect is shown in Fig. IHl and is several percent at / > 1000. On small 
scales where there is little power in the unlensed CMB the convolution transfers power from 
large scales to small scales to increase the small-scale power. 



4.1.1 Scale-invariant spectrum 

The defiection angle power spectrum l^Cf peaks on large scales / ~ 50 as shown in Fig. |21 
For I ^ 50 the bulk of the convolution term in Eq. ()4.12j) therefore comes from scales with 
/ ~ /'. Consider what happens if the temperature power spectrum is scale invariant on relevant 
scales, /^C® = const. The lensed power spectrum is then given by 
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3000 



Fig. 6. Top: the lensed temperature power spectrum (solid) and the unlensed spectrum (dotted), 
compared to the large I asymptotic result of Eq. H4.16() (dashed). Bottom: the fractional change in 
the power spectrum due to lensing. Both plots are for a typical concordance ACDM model. 



Cf^{l-l''Rf')Cf + l^Cfj 



dh' [\ ■ (1 - 1')]' 



12 



■•^ll-l'l 



;i - fR^)cf + Per / f ^ + I'cr f^^-^{i- my 



(4.14) 



The remaining integral is generally small, and the lensed spectrum only deviates from scale 
invariant at the O{10~^) level. If there were no lensing power at / > Iq, scale invariance would 
be preserved on scales I > Iq: a large-scale lensing mode magnifies and demagnifies small- 
scale structures, which has no effect if the structures are scale invariant. Lensing of the CMB 
is important because the acoustic oscillations and small scale damping give a well defined 
non-scale-invariant structure to the power spectrum. 
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4- 1-2 White spectrum 



Though of hmited relevance for CMB lensing, it is also interesting to consider the lensing 
effect on a white spectrum with Cf = const. In this case we have 



cf « (1 - pii*)cr +crl^, ii. ■ (1 + c* 

J li Zn 

= Cf (1 + 4(^2)), (4.15) 

where we used k = —'S/'^ip/2 to identify the variance of the convergence. Thus a white spectrum 
lenses to a white spectrum scaled by a factor depending on the total power in the lensing 
convergence. 

4-1.3 Small-scale limit 

The fact that I'^R^ ~ (9(1) at / ~ 2500 in Eq. (|4.12p serves as a reminder that the lowest-order 
series-expansion result is not likely to be accurate on all scales. This is because on scales with 
I ~ 2000 the deflection angles are becoming comparable to the wavelength of the unlensed 
fields, so the Taylor expansion is no longer a good approximation. For accurate results at all 
/ we need to perform a calculation that does not rely on a series expansion. In principle this 
could be done by extending the above calculation to all orders ([t^. however in practice it is 
most easily done by considering the correlation function as we discuss in the next section. See 



also Ref. (|7ll ) for an interesting alternative approach. 



Although the series expansion is not accurate for / ~ 2000, on very small scales, / ^ 3000, 
the series expansion result is actually accurate again. This is because the perturbations in the 
unlensed spectrum are nearly wiped out by diffusion damping, so the unlensed spectrum has 
very little power and can be described accurately on small scales by a single gradient term. 
Although the deflection angles are larger than the unlensed wavelengths on these scales, a 
large bulk deflection is unobservable so the effect on the observed small scales is the same as 
if the lensed field could be described by T ^ V?/' • VT. In particular if we set C® = in the 



first term of Eq. ()4.12p and use /' <C / we obtain (|73i |67|) 



C?«/^[l'-(l-l')fC»_,|C? 

«c*/i!!-|i'..fc,e 



(27r 

d^' 

(2vr) 

2 '2 '-'h 



dlo It C,® 



h 4:71 



VCfR''. (4.16) 
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Fig. 7. The functions a^{(3) = Cgi(O) - Cgi(/3) [solid] and Cgi,2(/3) [dashed] as a function of angular 
separation /3 (in radians) for a typical concordance model. For accuracy the results are calculated 
using the full-sky generalization, and use the linear power spectrum for Cf . 

Here we defined 



similarly to Eq. ()4.13|) to measure the power in the unlensed CMB temperature gradient. The 
lensed CMB power is therefore proportional to the unlensed gradient power R®, and to the 
power in the deflection angle I'^Cf. 

This can be understood as follows. Small scale clumps (or voids) in the lensing potential 
disturb the smooth background CMB gradient by the varying deflection angle over the clump, 
giving a clump-scale feature in the observed temperature. The amplitude of this wiggle is 
proportional to both the strength of the background gradient, and the size of the deflection 
angles. The wiggle is on the same scale as the clump, and the lensed power spectrum therefore 
follows the scale dependence of the deflection angle. See Section 111.31 for further discussion 
of the lensing effect from small scale fields, and in particular for the signal from non-linear 
structures such as clusters. 



4-2 Lensed correlation function 

For a more accurate calculation of the lensing effect on all scales we now calculate the lensed 
correlation function in the flat-sky limit, broadly following the method of Ref. |l^ . The 
correlation function is invariant under displacements: it only depends on the separation of the 
two points. Unlike the series expansion approach it is therefore insensitive to bulk unobservable 
shifts of the unlensed CMB. The result only depends on the correlation structure of the 
difference of deflection angles, which is what is directly observable. 




(4.17) 
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Lensing re-maps the temperature according to 



e(x) = e(x + a), (4.18) 

where now to get an accurate result on all scales we do not expand in a = 'Vip- The lensed 
correlation function ^(r) is given by 



e» = (0(x)e(x')) 

= (e(x + a)e(x' + a')) 

As before we have defined r = x — x' and neglected the small correlation between the CMB 
temperature and the lensing potential. The remaining average is over the deflection angle. For 
a Gaussian variate y with zero mean and variance cr^ = (y^) we have 

= e-^y'^/\ (4.20) 



Since we are assuming a is a Gaussian field, I ■ {a — a') is a Gaussian variate, and the 
expectation value in Eq. ()4.19j) therefore reduces to 



exp(-i([I.(«-«')]^)). (4.21) 



2 

To compute the average in the exponential we need to work out the correlation tensor 

(a.a;.) = (V.^(x)V,^(x')) = / (^^W.Cf e^''^ (4.22) 

By symmetry, the correlator can only depend on 6ij and the trace-free tensor r(ifj) = rirj — ^Sij 
where r = r/r. We evaluate these parts by taking the trace to give 



Cg,(r) ^ (a ■ a') 

J 27r ' 



l-JdH'CfMlr), (4.23) 
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and contracting with r^jf^) to give 



Cgi,2(r) = -2f(,f,)(aV^') 



(27r)2' 



(27r)2 2 



-Cfe 



(4.24) 



Here we defined as the angle between 1 and r, and used Eq. ()4.5p to express the integrals in 
terms of Bessel functions. Using these results we have 



{aia'j) = -Cgi{r)6ij - Cgi,2(r)f(ifj). 
We can now compute the required expectation value in Eq. ()4.21|) : 



(4.25) 



[1 ■ (a - cx')Y) = PP{{ai - a'^ia, - a')) 



:/2[C'gi(0)-C'gi(r)] + 2rPf^,f,)C'gi,2(r) 
:/2[a2(r) + cos20Cgi,2(r)]. 



(4.26) 



We have defined the quantity a^{r) = Cgi(O) — Cgi(r) = ^{{a — a.'y), which is half the variance 
of the relative deflection of the two points. 

Putting this all together we get the final result for the lensed correlation function 



iir) 



d^l 



QV-exp(-^([I.(«-«0]^) 



(2vr) 
d^l 

(27r)2^' 
dl PCf 
T 27r 



-C0e^'"™^<^exp ( — P[(r\r) + cos20Cgi,2(r)] 



^e^ilrcos<l,^-l^aHr)/2 ^ _ _ ^qs 20 Qi 2 (r) + . . . 



Jo(/r) + -/2Cgi,2(r)J2(/r) + 



(4.27) 



Here for simplicity we expanded in Cgi^2 before integrating over 0. Since Cgi^2 is significantly 
less than o"^ (as shown in Fig. Ej), the perturbative expansion in Cgi,2 converges much faster 
than one in and this is a good approximation for the scales of most interest. Note that it 
is important to keep exp(— /2cr2/2) as the main non-perturbative term in Cf. 

The lensed correlation function is related to the lensed Ci by Eq. ()4.4p . The calculation of 
the correlation function and the inversion to the power spectrum is numerically very efficient, 
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Fig. 8. Fractional difference between the lensed Ci results using the series-expansion method to 
lowest order in Cf compared to the more accurate correlation function result (temperature (solid) 
and E'-polarization (dashed; see Section [21)). 



requiring only one dimensional sums and integrals. It is used in the numerical codes CMB- 
FAST lli), CMBEASY (j?!) and CAMB (jgl). Results to the next order in Cgi,2 are given in ([Toil. 
along with the generalization to a non-fiat sky. This is sufficient for a calculation to within 
cosmic variance on scales where the non-linear power spectrum can be computed accurately 
and non-Gaussian corrections can be neglected. The full result is implemented in the public 
CAMB code. 



The difference between the result using the accurate correlation-function method and the 
lowest-order series-expansion result is shown in Fig. |H| The series expansion result is accurate 
on large scales where the deflection angle is much smaller than the wavelength of the modes, 
and also on very small scales where the unlensed CMB has little power and can be accurately 
approximated by a single gradient term. In the intermediate regime the series-expansion result 
is not accurate and the non-perturbative correlation function method must be used. Note that 
the errors are percent level by / ~ 2000, amounting to a significant fraction of the lensing effect 
on these scales. 



4-2.1 Exact result 



Although the series expansion we used in Eq. ()4.27|) is fully adequate for an accurate fiat-sky 
calculation in most cases relevant for the CMB, for completeness we note that the integral in 
Eq. ()4.27|1 can actually also be evaluated exactly as an infinite sum. Substituting r —>■ ir in 
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Eq. ()4.5|) and using Jniir) = i^Inij) where /„ is a modified Bessel function we liave 

oo oo 

g-.cos0^ ^ (-ir4(r)e*"<^ = /o(r) + 2^(-ir/„(r)cos(n0). (4.28) 



n=l 



Using tliis and Eq. (j4.5p to expand tlie exponentials in Eq. (j4.27|) . and integrating over 0, leads 
to the exact flat-sky weak lensing result for uncorrelated Gaussian fields with the temperature 
on a single source plane 

T ^e-'^-^M/2 E In [l'C,,,,{r)/2] J,^{lr). (4.29) 

Note that for \n\ > there are two equal contributions from n = \n\ and n = —\n\. Also 
note that although In{r) — * e''/v27rr for large r, because (T^(r) > Cgi_2(?") the combination 
e~' ^^^^'^ In{l'^C gi^2{r) / 2) goes to zero as / oo. For a fixed maximum /, only a low number of 
terms in |n| near zero are required to achieve good accuracy. This general form may be useful 
for calculating the lensing effect on very small scales / ^ 10^ when C® has small scale power 
and a low order expansion in Cgi^2 is insufficient. One non-CMB example where this may be 
useful is calculating the lensing effect on the 21-cm power spectrum 1711). 



4-2.2 Conservation of total variance 

The variance at a point ^(0) = (|6(x)p) can be obtained by noting that Cgi,2(0) = cr^(O) = 
and /„(0) = 6no- It follows from Eq. ()4.29j) that the total power is conserved by lensing 

This reflects the fact that weak lensing only alters photon directions, and hence the spatial 
correlation structure, but does not change the variance in any given direction. We are observing 
the same photons, just in a slightly different direction. 



4-2.3 Relation between lensed and unlensed correlation functions 

We can also calculate the lensed correlation function in terms of the unlensed one using 
Eqs. p:?f|l and 

l{r) = J^J dVe-^'-'e(rOe^'-^exp ^-h'[a\r) + cos2{^, - C,,,2ir)] 

dV^(r') / jfLe*Hr-r')g-/V2{r)/2g-a2,-/22)Cg,,2{r)/2 



(27r) 

dVe(r') / (r - vT r;W(^^>^)C^\ 
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I 

Fig. 9. Contributions to the total blackbody-spectrum temperature anisotropy Cf (thick sohd) from 
CMB lensing and kinetic SZ (kSZ) plus Ostriker-Vishniac (OV) for a typical ACDM model with 
inhomogeneous reionization. The kSZ+OV signal is taken from a model in Ref. (76), and can vary 
depending on parameters. The spectrally-distinguishable thermal SZ signal is shown dashed, though 
this also depends strongly on parameters. On very small scales kSZ+OV dominates both the unlensed 
(dotted) and lensed CMB power spectrum. 

In the middle line, we have resolved 1 into its components Iri and lr2 parallel and perpendicular 
to r. In an isotropic approximation where Cgi^2('") = 0, this is an isotropic Gaussian convolution 
of the two-dimensional correlation function, with a smoothing width cr^(r) that depends on r. 
This smoothing of features in the correlation function corresponds in harmonic space to the 
smoothing of the C® peaks. The angular integral can also be done to express the answer in 
terms of modified Bessel functions, but the result it not especially enlightening or useful. 



4-3 Comparison with other CMB temperature secondaries 

In addition to weak lensing, there are a variety of other non-linear effects on the CMB. The 
thermal Sunyaev-Zel'dovich (j2o[ l2lh (SZ) effect due to scattering from hot electrons can be 
significant on small scales. This signal has a frequency spectrum different to that of the 
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linear CMB and hence can in principle easily be distinguished. However there are other non- 
linear effects which, like CMB lensing, have the same frequency spectrum as the primordial 
anisotropics. These are a potential source of complication when analysing the lensed CMB as 
they cannot be easily distinguished, and may indeed be correlated with the lensing signal. 

The kinetic SZ effect is the main such effect for the temperature anisotropics ([ttI : Ffii Fsh . 
arising from scattering from electrons in a bulk flow of a non-linear perturbation with respect 
to the CMB. It is closely related to the second-order Ostriker-Vishniac effect fl79l) . they are 
both essentially Doppler terms determined by the electron velocity projected along the line of 
sight direction. These effects combine with additional signals due to patchy reionization (jsol ) to 
give a significant non-linear anisotropy from around the epoch of reionization and afterwards. 
Current uncertainties in the reionization history and morphology make detailed predictions 
for the spectrum difficult, and at the moment this uncertainty gives rise to a percent-level 
uncertainty in the expected total Cf at / = 2000 ((t^. Fig. |^ shows a sample comparison. 
Precision cosmology from the temperature damping tail will require accurate modelling of 
both lensing and the kinetic SZ effect, and they may indeed be significantly correlated ()77l ). 



For the polarization spectra the kinetic SZ effect is much smaller ()81l:l76fl. and lensing is likely 
to be the dominant effect. 

Other non-linear signals include the Rees-Sciama effect, a late time ISW-like anisotropy gen- 
erated by photons propagating though potential wells that are evolving because of non-linear 
growth or their bulk motion (js^)- This is generally sub dominant on all scales (jssh. and hence 
is not a major concern. There is also a second-order time-delay effect due to the varying 
potentials along the line of sight. However the effect turns out to be much weaker than the 



lensing contributions, and can probably be neglected (jSJ; |4 



5 Lensing of CMB polarization 



The presence of a non-zero photon quadrupole at last scattering means that there will be an 
interesting polarization signal. As for the temperature, the observed polarization field will have 
been lensed by potential gradients along the line of sight, and we need to model this effect. 
The polarization orientation along a lensed light beam is simply transported in the manner 
described in Section 12.41 At lowest order this means that the orientation of the polarization 
in the observed direction n and the lensed direction fi' will be the same with respect to the 
geodesic connecting the two points. The effect of lensing may therefore be modelled in a 
manner similar to the temperature lensing, with the added complication that the polarization 
is a tensor field that is parallel-transported along deflection vectors. 

We start with a brief review of the properties of the polarization field, and explain the decom- 
position into physically distinct E and B modes. We then calculate the effect of lensing in the 
flat sky approximation, both using the approximate series expansion and correlation function 
methods. We conclude this section with a discussion of the important impact of CMB lensing 
on the ability to observe primordial gravitational waves. 
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5. 1 Stokes ' Parameters 



A CMB photon detector measures an electric field E perpendicular to the direction of obser- 
vation n. Prom the observation we define the intensity matrix 

Vab = C{EaE;) (5.1) 

for some constant C that we will choose so that Vab is measured in temperature units. The 
angle brackets here denote a time average. The rank-two correlation tensor can be decomposed 
into irreducible parts 

Vab^Pab+^Sabl + Vlab], (5.2) 

where Pab = Vi^ab) is the symmetric trace- free part, the trace / is the total power intensity, 
and Vjab] = ^V€ab is the antisymmetric part. The antisymmetric part will only be non-zero if 
there is a phase lag between Ei and E2, and therefore describes circular polarization. Since 
the Thomson scattering responsible for the CMB signal does not induce circular polarization 
we shall neglect the V Stokes' parameter here. The intensity we have already discussed in the 
section on the lensing of the temperature field. Here we discuss lensing of the polarization 
described by Pab- 

It is often useful to describe the polarization with respect to a fixed basis in the plane orthog- 
onal to the direction of observation. For some choice of orthonormal directions ei and 62 the 
components of the trace-free symmetric polarization tensor can be written 

(5.3) 

This defines the two Stokes' parameters Q and U, which depend implicitly on the basis in 
which they are evaluated (physically, on the orientation of the polarizer when the observation 
is made). The Q parameter can be thought of as the difference in power measured when the 
polarizer is aligned with ei compared to when it is aligned with 62. The U Stokes' parameter 
measures the difference in power between the 45° directions ei ± 62. So Q and U change sign 
under a rotation by 90°, and into each other under a rotation by 45°. 

The eigenvalues of 2Pij are ±((5^ + t/^)^/^. The eigenvectors make an angle | tan~^(t//Q) to ei, 
so it is sometimes convenient to think of the polarization field as a field of headless vectors of 
length (Q^-l-C/^)^/^ at an angle ^ta,-a~^{U/Q) to the ei vectors. The headless vectors represent 
the basis-independent physical orientation in which a polarizer would give maximum signal. 

It is also useful to consider the components of Pab with respect to the complex vectors e± — 
ei ± ie2 

P^ele^Pab^Q + iU 

P*^e'ie'LPab^Q-iU. (5.4) 
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So instead of using Pab, we can if we wish describe the polarization in terms of a complex field 
given by the components with respect to a particular basis e±. 

On the flat sky we use the basis e± = ± iey. Consider what happens if we rotate our basis 
by an angle 7 clockwise (e.g. rotating e^; in the direction away from e^^). We can write this 
rotation in a convenient complex form as 

e± = ex± iey ^ ± ie'y 

— (cos 7 e-r — sin 7 Bj^) ± i (sin 7 e^; + cos 7 e^^) 

= e±*^(e^ ± iey) = e±^^e±. (5.5) 
It follows that in the rotated basis 

P' - e^'eVPab = e^'^P (5.6) 

Under the transformation e+ — > e^'^e^ a quantity sV is called spin-s if 

,77 ^ sVe''''. (5.7) 

So the P^^'y^ we defined above has spin 2, where the {x,y) superscript denotes that Q and U 
are measured with respect to the e^ and basis. 

On the sphere, the natural basis consists of the and e,^ vectors of the spherical-polar 
coordinate system. Looking from inside the sphere, 6$ points along great circles from the 
north to south pole, and e^ points to the left orthogonal to e^. There is therefore potential for 
sign confusion because {eg, e^) form a right-handed set (viewed from inside), whereas (e^;, e^,) 
define a left-handed set, so we must be careful with how we define things. The rotation 

eg + ie^ e^^{eg + ie^) (5.8) 

corresponds to an anticlockwise rotation about the outward normal r. If we measure Q and U 
with respect to the (e^, e^,,) basis P^^'"^) will be spin 2 as before. However if we define Q and U 
with respect to the left-handed set analogous to {ex,ey) using (e^, — e<^), the complex p(^'~'?^) 
then has spin -2. Both conventions have been used in the literature. On the sphere we shall 
define Q and U with respect to and e<^, so P is spin 2. We note, however, that this differs 
from the astronomical (lAU) standard which defines Q and U on — and e^. 

5.2 E and B polarization 

We can write a polarization tensor field Pab in terms of two scalar fields Pe and Pb as 

Pab = V(„V6)Pe + e^(„Ve) V^Pb, (5.9) 
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where Pe describes the 'E'-mode' gradient-hke polarization, and Pb described the 'S-mode' 
curl-like polarization. The angle brackets around the indices denote the symmetric trace-free 
part, and Vq is a covariant derivative in the surface on which we measure Pab (i-e. orthogonal 
to the observation direction fi). If the fields are defined over a surface without boundary (as 
the polarization is on the full sphere), this decomposition is unique because the derivatives are 
invertible. What happens on a surface with boundaries (for example observations over only 
part of the sky) is more complicated, and discussed in detail in Refs. (jist 



Locally pure E and B quantities can be extracted by taking two derivatives to form the 
pure E quantity V^"V^^Pa6 and pure B quantity ec'-'^V'^-'V^Pafe- However since integration of 
these terms to recover the underlying Pe and Pb is inherently non-local, it is usually more 
convenient to work directly in harmonic space. The scalar fields Pe and Pb can be expanded 
in terms of scalar harmonics Qk, giving 

Pab = —f^ X! \P'kQ^ah + BkQ^ ab , (5.10) 

where we have defined gradient and curl tensor harmonics (jsih 

Qkab = NkV{a'^b)Qk Qkab = NkC^a"^ b)"^ cQ k (5.11) 

and Nk is some normalization. The E^ and B^ are the harmonic components of the gradient- 
type and curl- type terms, and the factor of l/\/2 is inserted for later consistency. 

We can also expand P, the complex spin 2 component, in harmonics as follows. The alternating 
tensor can be written as in terms of e± as 

e-b = -ie^^e1. (5.12) 

Using the fact that el = and e_|_ ■ e_ = 2, we then have 



P = e'! P 



^a _6 



V2 t 



EkSlelVaVbQk + -e_aelele\VbVcQk 



T.^iEk + iBk)e\e\VaVbQk 
k v2 

Y,{Ek + iBk)2Qk. 



(5.13) 



Note that some authors define harmonic coefficients so they differ by a sign from those here ()8£ 
changing the sign of the temperature E'-polarization cross-correlation power spectrum. The sign here 
is different to the output of CAMB and CMBFAST. 
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Here we have defined the spin- 2 harmonics, 2Qfe, given by 



^Qk = -^ele'.Q^ab = ^e^e^VaV^g^. (5.14) 

Why don't we just expand the complex field spin-2 field P directly in terms of scalar har- 
monics? We certainly could do this, but the harmonic coefficients would then be spin 2, and 
therefore still depend on the choice of local basis orientations. Instead, we want to expand 
in spin-2 harmonics, so that the harmonic coefficients and are related to physically 
relevant basis-independent scalar quantities. 

The complex conjugate P* = Q — iU has spin -2, and can be expanded analogously in terms 
of spin —2 harmonics _2<5fe giving 

P* ^^{Ek-iBk)-2Qk (5.15) 

k 

where 

_2Qk = -^e''_e'LWa^bQk. (5.16) 



5.2.1 Flat-sky harmonics 

On the flat sky we use harmonics Qk — e*''^, so the tensor harmonics are 

QS,(l) = -x/2y,)e^'- (5.17) 
gS.(l) = -V2eV^W'" (5-18) 

where we used the normalization factor Ni = V^/l"^ required so that the tensor harmonics are 
normalized. The spin-2 harmonics are therefore given by 

2g(l) = r2e«e^V„V5e^'-^ 

= -(cos 01 + i sin 0i)^e*^''' 

We can therefore expand the polarization field as 



P^Q + iU^- [{E{1) + zB{\)] e^^'^'e^'"' (5.20) 

P*^Q-iU ^ - [{E{1) - iB{l)] e-'^-^'e^'-". (5.21) 

J ATT 
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The e^2i0i factors serve to rotate between a natural basis defined in terms of 1, and tlie fixed 
basis defined in terms of e^. The inverse relations are 



E{l)+iBil) 



dSc 

271 

d^x 



Pg-2i<^lg-il-x 



*g2i</>ig-il-x 



(5.22) 



5.2.2 Full-sky harmonics 



On the full sky we can expand scalar fields in terms of scalar harmonics Qk = Yim, and 
therefore define spin 2 harmonics 



2Yim = ^eletVaVbYim 



V2 



(5.23) 



and similarly for the spin -2 case. We cannot give a neat closed form expression for the 2^«m; 
but they can be calculated recursively as can the spherical harmonics themselves (jssl). The 
normalization may be shown to be 



N, 



2(/-2)! 



(5.24) 



5.3 Lensed polarization power spectra 



5.3.1 Lowest-order calculation 

We start by doing a simple first-order calculation, analogous to the one we did for the temper- 
ature field in Section I^^Tl The lensed polarization tensor Pab can be expanded in the deflection 
angle as 

P„6(x)=P„fe(x+ VV') 

^ P,fe(x) + V^V^VePafe(x) + ]^V'^ljV^^PVcVdPah{^) + • • • • (5.25) 

As in the temperature case, we cannot expect a second-order truncation to be very accurate 
on all scales, however it is useful for a simple analytical analysis that correctly describes the 
main aspects of polarization lensing. Because the e± vectors are independent of position on 
the fiat sky, they commute with derivatives, so the same expression holds if Pab is replaced 
with its spin-2 components P = e'^e^Pab- 

Scalar mode perturbations produce no P-mode polarization at linear order, in which case 
B{\) = 0. The lensing effect of tensor modes is expected to be small, so we are mostly 
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interested in the S-modes generated from lensing a purely i?-mode unlensed field. To make 
our life a little simpler, we shall therefore set 5(1) = 0. Performing a harmonic expansion we 
then have the lensed harmonic coefficients 

E(l) ± iB{\) ^ E(l) - / ^ 1' ■ (1 - l')e±2i(0,,-<^O^(i _ \')E{1') 

-I /^e±2^(<^"-«li-[li + l2-l]lrl2i5;(li)V(l2)V^*(li + l2-l). (5.26) 

For a statistically isotropic field the unlensed power spectra are given by 

{E{\)E*{\')) = 5(1 - \')Cf {B{\)B*{r)) = (5(1 - l')Cf {E{1)&*{\')) = 5(1 - \')C{' , 

(5.27) 

where we have assumed that statistical parity invariance holds, so that B in uncorrelated 
with B or E. Here we are taking = 0, but will calculate the corresponding non-zero lensed 
S-mode power spectrum. Keeping terms to order Cf we therefore get 
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d^' 



(5.28) 



d^r 

(27r)2 

■ d^r 

(2vr)2 



l'-l)2C,^' (5.29) 



l'-l)2Qf. (5.30) 



The integrals are actually real because the non-complex terms are symmetric as 1' is rotated 
in either direction from 1, so we can replace the exponential with a cos-term. This gives the 
final result for the lensed spectra to lowest order in (67) 



Cf = {\-l^R^)Cf^ /U[l'-(l-r)f Cf_,|qf|cos2 2(0i,-0i) (5.31) 



21/ 



C'f = y ^ [1' ■ (1 - 1')]' qti'iqf I sin^ 2(0, - 0i) (5.32) 
Cf = {\-l^Ei>)Cf^ /^[r.(l-r)fqti'|qf I cos 2(0,-0.). (5.33) 

Here we have used the definition of from Eq. ()4.13|) as in the temperature case. The 
more general result allowing for unlensed S-modes can be obtained as a straightforward 



generalization, and is given in Ref. (jfiTI). We will also include the 5-terms in the next section 



when we do a more accurate calculation in terms of the correlation function. 

Since the tensor modes would only have a significant signal on quite large scales it is interesting 
to consider how large the 5-mode lensing signal should be there. If we take |1| <^ |1'| we have 
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Fig. 10. The unlensed E-power spectrum l{l + l)Cf" /2it (top, dashed), lensing potential power spec- 
trum [/(/ + 1)]^C/' (dot-dashed), and the contribution to the large-scale lensed S-mode power spec- 
trum from each log / given by half their product (thin solid) . The dotted line is the large-scale white 
spectrum /(/ -|- 1)CP /2tt given by Eq. (|5.34j) . which can be compared to the full numerical result 
(thick solid). The lensed B-mode power spectrum has a close-to- white spectrum at I <^ 1000. 

= ^ / y l"Ct V'Cl (5.34) 

independent of /, corresponding to a white-noise spectrum for Cj^ on large scales. This is 
actually a rather good approximation because there is little ii^-mode power on large scales, so 
the spectrum is indeed close to white on scales / <C 1000, as shown in Fig. ^| For standard 
constant spectral index ACDM models allowed by the current data Cj^ ~ 2 x 10~^yuK^. 

The scales responsible for the large-scale 5-mode signal cover a wide range in /, as shown in 
Figs Uni and HU To see why this should be so, consider a i?-pattern at last scattering with some 
fixed spatial frequency kE- If this is then lensed by an aligned lensing potential mode with 
~ kE (perpendicular to the line of sight), the mode coupling produces power on scales 
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Fig. 11. The contributions to the low-/ (/ ~ 10) power spectrum from lensing potential with 
I < ^max- A significant fraction of the power comes from the tail of Cf at high /. 

down to \k^ ~ kE\- This is the power that will be observed in the 5-mode on large scales 
(and also in the £'-mode if it weren't dominated by the much larger unlensed signal from 
reionization and last scattering). There are contributions to the large scale B modes from all 
scales on which there is non-zero E and lensing potential power. Since the relevant E and ip 
modes are dominant on small scales (the last-scattering E is small on large scales), they have 
a short correlation length. It follows that each contribution to the lensed B from different 
coincidentally-aligned patches of E and ip will be nearly spatially uncorrelated. Uncorrelated 
random fields in real space correspond to a white spectrum in harmonic space, hence the 
prediction for a very nearly white i?-mode spectrum on large scales where there is little E- 
mode power (/ < 1000). 

As we have shown a purely scalar E'-mode polarization signal at last scattering will give rise 
to non-zero S-modes after lensing by the potential along the line of sight. This has important 
consequences for the detectability of primordial tensor modes via their B-mode signal, as 
discussed in the next section. 

The effect of lensing on the ii^-mode power spectrum is more like the effect on the temperature: 
it is essentially a convolution with the lensing potential, which smooths out the acoustic peaks. 
Because the unlensed polarization peaks are rather sharper than in the temperature case, this 
actually means that the effect of lensing is quantitatively more important on the spectrum. 
The fractional change in the spectrum is shown in Fig. and is clearly seen by eye in Fig. ^| 
Lensing must therefore be modelled in order to get accurate results when doing parameter 
estimation from polarized power spectra. On very small scales the unlensed polarization is 
damped and there is very little unlensed power. We can therefore apply a similar argument to 
Eq. ()4.16|) that we used for the temperature spectrum on small scales: the unlensed polarization 
field can be accurately described by a single gradient term. Using /' <^ /, we get the / ^ 3000 
approximation 
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Fig. 12. The unlensed T-E correlation and E-mode power spectra compared to the smoother lensed 
results. 

Cf ^Cf [V . f C|f I cos^ 2(0, - 00 



i2ny 



(5.35) 



for the i?-polarization and 



1 



21/ 



(2vr)2 



[V ■ 1]' I sin2 2(0, 



(5.36) 



for the i?-polarization. So on small scales Cf" ~ CP, and the power is proportional to the 
deflection angle power at that scale, as shown in Fig. UHl Here 



R^'^^Jj I'CP^ = (I Vgr) = (I Vt/p) ~ 2 X lOVK^ 



(5.37) 



measures the total power in the gradient of the unlensed polarization field and is defined 
analogously to Eq. ()4.13|) when there are no unlensed S-modes. The corresponding rms un- 
lensed gradient is (IVQP)^/^ = (iVf/p)^/^ ~ 1.3 /xK arcmin'^ The fact that E and B have 
equal power on small scales is a special case of more general results for lensing of polarization 
gradients by small lenses (see Section lll.3.1|) . 
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Fig. 13. Top: the lensed E (top solid) and lensed B (bottom solid) power spectra, compared to 
the unlensed E spectrum (dotted) and the asymptotic result of Eq. H5.35() (dashed). Bottom: the 
fractional change in the E power spectrum due to lensing. All results are for a fiducial standard 
ACDM cosmology. 

5.3.2 Lensed polarization correlation functions 

As in the case of the temperature spectrum, the series expansion in the deflection angle that we 
have used in the previous section is not expected to be very accurate on small scales, so for a 
more accurate calculation of the lensed power spectra we need a non-perturbative calculation, 
most easily performed via the correlation function. The calculation is rather similar to the 
one for the temperature we did in Section 14.21 so we shall not labour the similar parts of the 
derivations here. The calculation was first done in Ref. (|9ol). though here we include new fully 
non-perturbative results as well as lowest terms in the series expansion. 

We shall work from the spin-2 polarization field P. The scalar correlation function between 
polarization at x and x' should be independent of the basis used to define P at the two points. 
To do this, we want to describe the polarization in the physically relevant basis defined by 
r = x — x'. If r makes an angle (pr to the e^. axis, this amounts to rotating the basis by an 
angle (pr anticlockwise at each point, giving Pr(x) = e~^*''^''P(x). In this physical basis we can 
then define the basis-independent correlation functions 
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Ur) = (P;(x)P.(x')) = (P*(x)P(x')) (5.38) 
Ur) = (Pr(x) P.(x')) = (e-^^<^^P(x)P(x')) (5.39) 
ex(r) = (P.(x) 0(x')) = (e-2^^^P(x)e(x')). (5.40) 

In tensor notation the first is proportional to (Pafe(x)P"*(x')) and the second to the spin-4 
component of (Pafc(x)Pcrf(x'), however on the sphere one has to be careful to define what 
one means by contracting tensors at different points , so we shall stick with the complex 
representation that generalizes more obviously to a spherical calculation. We note that ^+(r) 
is real because of statistical rotational invariance. In terms of the Stokes' parameter defined 
in the physical basis by Pr = Qr + iUr the correlation functions are 



e+(r) = (g.(x)g,(x') + Ur{^)Ur{^)) (5.41) 

e-(r) = (g.(x)g,,(x') - f/,(x)f/,(x')) + «(g.(x)f/,(x') + f/,(x)g,(x')) (5.42) 

U{r) = (g,(x)0(x')) + ^([/,(x)0(x')). (5.43) 

If we assume that statistical parity invariance also holds, the correlators must be invariant 
under reflection of the fields. Reflecting in r, the temperature and Qr Stokes' parameter remain 
the same, but Ur changes sign, so that (gr(x)[/r(x') + f/r-(x)gr(x')) and (?7r(x)0(x')) would 
change sign. For a statistically parity-invariant ensemble we therefore expect the imaginary 
parts to vanish, and we shall neglect them from now on. 

The calculation for the lensing correlation function ^+(r) is exactly analogous to that for the 
temperature in Section W?]\ with Cf replaced by Cf + Cj^ , 



e+(r) = (P*(x + «)P(x' + a')) 

= ldliCf + CP)e-^'^'/' Y: /n(/'Cgi,2/2)J2„(/r) 

= ^ / W/ (Cf + CP) e-'^'^^M/^ [jo(/r) + h'C,,,2ir)Mlr) 



(5.44) 



The calculation for ^-(r) is however a little different because of the rotation factor e we 
have 



The required operation is to parallel-transport one of the tensors along the geodesic connecting 
the two points. 
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|_(r) = (e-^^'^'-P(x + a)P(x' + a.')) 

1 



(27r)2 
(27r)2 



(Cf - )e*' "cos40 exp ~-f[a\r) + cos20Cgi,2(r) 



/(^-E /^B\ ilr cos (f> —Pa'^( r)/ 2 



d(/2Cgl,2/2)^ 



exp [ — -/^cos20Cgi^2(^) 



- / m (Cf - C,«)e-'^'^^M/2 ^ [2/:(/^Cgi,2/2) - 4(/^Cg,,2/2)] J2n(/r 
i- Jldl {Cf - )e-'^'^^M/^ (j4(/r) + \l^C,,,,{r)[Ulr) + J6(/r)] + ...), (5.45) 



where as before we defined (f) = — (p,,. and dashes denote derivatives with respect to the 
argument. The cross-correlation derivation is similar, giving 



^^(r) = (e-^^<^'-e(x + a)P(x' + a')) 



^C,-^e^i-^cos20 exp (-^/^[^^(r) + cos 20 ^^.^(r)] 
I /d/Cf e-'^'^'M/^ (j2(/r) + ^/^Cgi,2(r)[Jo(/r) + Ulr)] 



(5.46) 



The series expansion to next order in Cgi,2 and the spherical gen eralizations needed for a 
calculation accurate to within cosmic variance are given in Ref. (17(1 ) (see also Section IH)). 

The relation between the correlation functions and the power spectra follow from a similar 
argument to Eq. ()4.3|1 . or using the orthogonality of the Bessel functions combined with the 
unlensed limit of Eqs. dOH), and 



Cf + Cf = 271 j rdr Jo(^r)^+(r) (5.47) 
Cf - = 27: J rdr JSr)i-{r) (5.48) 
C/^ = 27r / rdrJ2{lr)ix{r). (5.49) 



The correlation function method is significantly more accurate than the lowest-order series- 
expansion result at 1000 < / < 6000 as shown in Fig. and must be used for an accurate 
calculation of the lensed power spectra. 
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Fig. 14. The 95%-confidence regions for the CMB polarization power spectra given a 2005 compilation 
of CMB and large scale structure data from Ref. (|9ll ) . The parameters varied are those for a constant 
spectral index ACDM model with insignificant neutrino mass and sharp reionization. The two tensor 
results are the B-mode power spectra expected from for a scale- invariant tensor-mode power spectrum 
with At = 4r X 10~^ and two possible values of r. The effect of tensors on the cosmological parameter 
constraints was neglected, and a prior z^e > 6 was assumed. 

5.4 B-mode confusion: implications for constraining inflation 



Much future work on the CMB will be directed towards the detection of any primordial 
gravitational waves (tensor modes). This is because tensor modes produce a distinct S-mode 
polarization signal that can serve as a 'smo king gun' for primordial gravitational waves if it 
can be distinguished from the lensing signal (|88l:l92ll. Any detection of tensor modes would be 
of great interest as a way of discriminating different classes of models of inflation and other 
models of the early universe ^ Iq^: loih. 

The primordial tensor modes are described by the transverse traceless part of the metric 
tensor, hij, which is expected to be Gaussian in most models. The power spectrum Ph{k) 
of the super-horizon modes after inflation (or more generally, near the start of the radiation 
dominated era) is defined by 




(5.50) 
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It is usually parameterized as 

P4k) = ATij^] (5.51) 

for some choice of pivot scale k^, where most inflationary models predict that rix is close to 
being constant. In the context of slow- roll inflation, values of r consistent with current upper 
limits imply the tensor spectral index tit ~ 0. For simplicity we shall assume ut = 0, which 
is likely to be a good approximation over the relevant range of scales. The pivot-point power 
At is used to measure the amplitude of the primordial tensor modes. For standard single-field 
slow-roll models of inflation, the tensor amplitude is related approximately to the amplitude 
of the inflaton potential when the mode left the horizon by (jQ Q) 

OA 

K ^ ^m^i ^ At[4.8 X 10^^GeV]^ (5.52) 
128 

where mpi is the Planck mass. For standard ACDM models a tensor amplitude of At ~ 4x 10~^ 
would contribute about the same power as the scalar modes to the large scale CMB anisotropy. 
If we define a 'scalar/tensor' ratio 

r = A/(4 X 10-^) ~ K/[3.8 X 10^^GeV]^ (5.53) 

it will be approximately the ratio of the tensor to scalar contributions to the large-scale CMB 
temperature power. There are many different definitions of r in the literature , so for clarity 
we have defined it here in terms of the unambiguous tensor mode power At- The smallest 
tensor amplitude clearly detectable within the cosmic variance of the CMB temperature signal 
is around At ~ 4 x 10~^'^ (r ~ 0.1), so the CMB polarization B-mode 'smoking gun' (|92i ) is 
needed to do much better than this. 



The problem is that we have no idea what the tensor amplitude is, and it could be small. As 
we showed in Eq. fl5.H4|l the lensing of purely scalar E-mode polarization will produce a nearly 
white lensed B-mode spectrum CP ~ 2 x 10~^/iK^ (corresponding to a standard deviation 
averaged in a pixel of 1 arcmin of y/C^ ~ 5/iK-arcmin). If the B-modes from the tensor modes 
become comparably small, the lensing signal becomes an important source of confusion. For 
the Planck satellite the experimental noise gives A*";^ ~ 4 x 10~^yuK^ iV-^F ~ 70/iK-arcmin), 
which is large enough that the lensing 'noise' is only a very small correction. However for 
future more sensitive observations, or those coving a smaller area of sky, the lensing signal may 
become a limiting factor. The range of allowed by current data is shown in Fig. along 
with various possible tensor B-mode spectra with different amplitudes. The large scale white 
noise power expected from lensing is currently constrained to be CP = (2.0 ± 0.2) x 10~^/xK^ 
in standard ACDM models. 

Treating the lensing signal as a Gaussian noise, for a nearly full-sky observation with no 
experimental noise the lowest tensor amplitude that can be distinguished is about At ~ 

Common definitions for r are the ratio of the CMB temperature quadrupole powers, or a ratio of 
some scalar primordial power to the tensor power at a given scale. The definitions differ by order 
unity factors that depend on the cosmological parameters or choice of scalar variable. 
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(1-4) X 10~^^ (depending on the reionization redshift), corresponding to r ~ (0.2-1) x 10~^ 
and an energy scale of inflation ~ (5-7) x lO^^GeV (97; IssI ). As observations approach the 
lensing hmit, they need to survey more and more of the sky in order to beat down the lensing 
variance: the presence of the lensing signal therefore pushes observational strategies towards 
favouring large sky coverage (joTl). The issue is complicated by the mixing of E and B modes 



when only part of the sky is observed (jisl : Sot 87; 981), which also favours larger sky coverage. 
Since the tensor-mode power is mostly on large scales [l < 200), this lensing noise limit 
can be reached with low-resolution experiments. However, as we shall discuss in Section |HJ 
high-resolution observations can in principle be used subtract most of the lensing signal, and 
therefore obtain significantly better limits on the primordial gravitational waves. 



6 Non-Gaussian structure 



Inflationary and other models predict a very nearly Gaussian spectrum of primordial pertur- 
bations. Since evolution to last scattering is essentially linear, the CMB anisotropics should 
also be very nearly Gaussian. Furthermore the lensing potential is most sensitive to large-scale 
modes of the density field which are also Gaussian. On small scales where non-linear evolu- 
tion is important there will be many perturbations along the line of sight, reducing the effect 
of any non-Gaussianity in the lensing potential by the central-limit theorem. However the 
lensed CMB — a non-linear function of the two nearly Gaussian fields — will not be Gaussian. 
In this section we discuss some of the properties of this non-Gaussian signature, assuming 
for simplicity the unlensed CMB and lensing potential are exactly Gaussian. Deviations from 
Gaussianity due to non-linear evolution can be assessed using numerical simulations (jisi : 77). 
We shall not discuss other potentially important non-Gaussian signatures, such as that due 
to the SZ component (jiil). 



6.1 Three-point function (bispectrum) 



For an unlensed sky, the temperature three-point function vanishes because there are an odd 
number of products and the distribution of B is an even function, so (G(li)B(l2)0(l3)) = 0. 
To consider the leading-order non-Gaussian signal for the lensed field we use the lowest-order 
series-expansion result from Eq. ()4.9p : 

9(1) ^ 6(1) - / ^ 1' ■ (1 - - i')e(i'). (6.1) 

The three point function then has non-zero even terms, and we have to lowest order in C^^ |73h 
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(e(ii)e(i2)e(i3)) 



21/ 



d^l 



r • (I3 - l')(e(li)^(l3 - 1')) (6(12)6(1')) + perms 



27r 



6{h + h + I3) K'^C^.h ■ h + perms) , 



(6.2) 



where 'perms' indicates the five other permutations of li . . . I3. The three point function 
therefore directly probes the large-scale temperature-lensing potential correlation. The cross- 
correlation on large scales is due to the integrated Sachs- Wolfe contribution to the primary 
anisotropy. This comes from low redshift, and hence is correlated with the matter fiuctuations 
contributing to the lensing potential. The bispectrum can therefore be used as a probe of the 
perturbation growth and expansion history of the uni verse at lo w r edshift, and he nce can be 
used to help constrain the dark energy and curvature ( lOCll : 101 : 21] 1021 : 1031 : 104 ). 



On small scales where all the 1 are large the cross-correlation is negligible (see Fig. and 
the three point function vanishes even for the lensed sky. Note that because the lensed field is 
linear in 6, the three point function is exactly zero if there is no cross-correlation — this result 
does not depend on the series expansion. On very small scales our assumption of Gaussianity 
of the lensing potential will not be quite correct and there may be additiona l contribu tions 
to the bispectrum because of non-linear evolution (and hence skewness) of ijj (jlOSl : Il06t ): the 
distribution must be skewed in non-linear theory because the matter densities are constrained 
to be larger than zero. Late-time non-linear effects other than lensing may also introduce a 
non-zero bispectrum, in part i cula r thro ugh correlations of CMB secondaries with the lensing 
potential, especially SZ (jlOli llCrzi ITosh . 



6.2 Four-point function (trispectrum) 



The four-point function is the first higher-order correlation function that has an interesting 
signal on small scale s where t he lensing potential can be taken to be statistically independent 
of the temperature For the Gaussian unlensed fields we have 



(6(11)6(12)6(13)6(14)) = C®C®5(li + h)5ih + I4) + perms. 



(6.3) 



The non-Gaussian signature is therefore in the connected part of the correlation function that 
vanishes for Gaussian fields u2, 



(6(11)6(12)6(13)6(14)). = (6(11)6(12)6(13)6(14)) - [C«C«5(li + l2)5(l3 + I4) + perms 
^ ^^^(li + I2 + I3 + U) [cf^+i3|CgC,^(li + I3) ■ I3 (I2 + I4) ■ I4 + perms; 



(6.4) 



where 'perms' denotes all permutations of li . . . I4. The four-point function therefore probes the 
lensing potential power spectrum. It should be clearly disting uish able from cosmic variance 
for observations with high sensitivity at / > 1000 (see Ref. (jliol l). We shall return to the 
issue of extracting information about the lensing potential from non-Gaussian signatures in 
Section [3 
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The n-point functions can also be evaluated in real space, for example the connected four point 
correlation is (G(xi)0(x2)G(x3)0(x4))c. These forms are often more convenient for partial-sky 
observations as they are manifestly local. However the number of different pixel correlations 
is very large, and computing all possible quadrilaterals would be practically impossible. For 
this reason practical estimators tend to conce ntrate on particular subsets of geometries, for 
example when two points are the same (jl09f ). For lensing purposes the main interest is in 
reconstructing the lensing potential, and methods adapted for this purpose are described in 
Section [7| 

Secondary anisotropics such as SZ will have some correlation with the small-scale lensing 
potential, int rodu cing additional non-Gaussian terms and modifying the expectation for the 
trispectrum ()l08| ). 



The fact that lensing induces a non-zero four-point function also changes the covariance of 
lensed power spectrum estimates relative to the uncorrected Gaussian expectation (lAQp) = 
'iCf / (2/ + 1). This is essentially because the large scale lensing modes correlate the C/, reducing 
the effective number of independent 0;^ observations. The effect is however rather sm all (~ 
2%) at Planck sensitivity, though potentially becoming more important on smaller scales (|llCl ). 
The effect is, however, significant for i?-mode polarization as we describe in Section 



6.3 Connected n-point correlation functions (cumulants) 



For a Gaussian field the n-point correlation functions are equal to their disconnected parts 
for n > 2. The connected part of the functions for n > 2 therefor e defi ne an infinite series of 
higher-order non-Gaussian signals, also known as the cumulants (jllll ). Since the lensed field 
is linear in the unlensed G, all the odd-cumulants are zero both with and without lensing 
(neglecting small large scale cross-correlations). The even cumulants for n > 6 depend at 
0{Cf) on n > 2 cumulants of the lensing potential, which are zero if the lensing potential is 
Gaussian (|l08[ ). However there will be non-zero higher-order terms. 



6.4 Polarization 



The polarization fields are essentially uncorrelated with the lensing potential so the n-point 
functions with n odd are zero . The trispectrum for £^-mode polarization is similar to that 
for the temperature anisotropics: 



^ There will be a tiny correlation from polarization generated by CMB quadrupole scattering at 
clusters. 
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(£^(ll)^(l2)^(l3)^(l4)). 



2(27r)2 



6{h + I2 + I3 + I4 

■4> riEriE 



Cl+uCtCtWE{\i. -l3)W^i.(l2, -I4) + perms 



(6.5) 



where ^^^(l, 1') = 1' ■ (1 — 1') cos2(0i — (py) and we have ignored any unlensed 5-modes. For 
the lens-induced 5-modes, at leading order these are first order in so the trispectrum is 
0{Cf f] we find 



{B{\{)B{h)B{h)B{\,)), ^ -5(li + I2 + I3 + I4) 



X 



d^L 

(2vr)4 



Cl qt+i3-L|CrL|C+L|W^B(li, L)W^b(12, 



X Weih. li + I3 - L)iys(l4, L - li - I3) + perms] , (6.6) 

where V^b(1, 1') = 1' ■ (1 — 1') sin2(0i — 01/). The lensed E and B modes still have opposite 
parity and so statistical parity invariance (with isotropy) demands that they are (exactly) 
uncorrelated. However, they are no longer independent and all of the mixed cumulants (i.e. 
those with one, two or three lensed -B-modes) are non-zero. We shall only give the result 
here for the cumulant involving two lensed E'-modes and two lensed S-modes since this is 
responsible for correlating the E and 5-mode power spectra. At leading order we have 



{E{\{)E{h)B{h)B{U)), 



(27r) 



-5{\i + I2 + I3 + I4) 



X Qfc.t [ql,+i3|H^s(i3, -\i)Wb{U. -I2) + qT,+i3|W^s(i3, -12)^^(14, -li)J . (6.7) 

Trispectra can also be formed between the temperature and polarization fields but we shall 
not consider these here. For a discussion of the spherical generalization of the polarization 
trispectra, see Ref. JiiJ. 

As with the temperature, the non-zero trispectra alter the covariance of polarization power 
spectra estimates. The most imp ortan t effect is in the covariance of the 5-mode power since all 
of the signal comes from lensing The flat-sky covariance of the lensed B-mode spectrum 

is approximately 



coviCr,Cp 



B\2 



21 



,d0,T,(I, -1,1', -1') 



(6.8) 



where the connected trispectrum Tc(li, I2, 13, 14) is given by the factor on the right of Eq. ()6.6|) 
that multiplies the delta function. The presence of this term increases the diagonal of the 
covariance and correlates the power spectra over a broad range of scales (|ll3l : lH^. It is 
essential to include properly this non-Gaussian covariance in parameter estimation from future 
lensed 5-mode spectrum estimates, both to avoid under-estimation of the errors and biases in 
the estimated parameters The number of independent 5-modes produced by lensing is 

lower than if the fleld were Gaussian and this reduction demands that (more) careful attention 
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be paid to the approximate form of the hkehhood that is assumed for parameter estimation 
to avoid further bias. 



6.5 Other non- Gaussian signatures 



Isotropic Gaussian random fields have many special properties , deviations from which consti- 
tute many different possible non-Gaussian signatures (113; Eli). The lensing signal is expected 
to show up in many of these. For example the hot and cold spot ellipticities have a very specific 
probability distribution for Gaussian fields 

(El), 

so additional ellipticities caused by lensing 
shear should show up clearly as an excess of more elliptical shapes (jish. Likewise the hot and 



cold spot correlation function should be changed in a characteristic way ()117l : 111 



6.6 Distinguishing primordial non-Gaussianity 



Lensing changes the spatial correlation structure of the CMB field, but does not change the 
distribution at a point. It follows that any primordial non-Gaussian signature in the one- 
point function will be preserved by lensing: points are moved around, but the temperature 
distribution of the points is unchanged for an observation with infinite resolution. However 
beam effe cts do com plicate the analysis, so lensing does change the beam-convolved one-point 



function (lllUin 



The three-point function (bispectrum) from lensing ideally only has a cross-correlation signal 
on large scales, but in practice has an additional signal from SZ-lensing correlation on small 
scales. These are important sources of confusion fo r a p rimordial bispectrum, but may be 



distinguished by their distinctive s cale dependence (|1Q7D. The lensing also has a significant 
four-point function (trispectrum) ()ll9l ). but again could be distinguished from primordial 
signals by the approximately known amplitude and different scale dependence (|l20l ). The 
non- zero lensing trispectrum also means that the variance of the bispectrum is increased from 
the Gaussian result: le nsing makes it harder to distinguish a small scale primordial bispectrum 



from cosmic variance (|l2ll ). 



Confusion with kinetic SZ and point source signals may be more important than lensing 



though different sources should have different scale dependence (llOTi ). The CMB polarization 
may be much cleaner (SZ signals are higher order), and if sufficiently sensitive CMB polar- 
ization data can be used some primordial non-Gaussian signals may ultimately be lensing 
limited. However since non-Gaussian scalar modes only produce linear ii^-modes, if precision 
-B-polarization data is available it may be possible to use the information in the B modes to 
reconstruct the lensing field (see Sec. [Tj), effectively removing some of the non-Gaussian signal 
due to an unknown lensing potential. 
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7 Reconstructing the lensing potential 



A fixed large-scale lensing potential will lens smaller background unlensed CMB anisotropies in 
a characteristic way. By measuring a large number of these anisotropies it should therefore be 
possible to extract information about the lensing defiection field. The unlensed CMB field is, 
of course, unobservable, but its statistics are very well understood. Statistical measures of the 
lensed CMB fields should therefore be able to constrain the lensing potential 
On small scales, where there is little unlensed CMB power, the lensed CMB measures the 
small-scale lensing potential more or less directly. 



The lensing potential can also be par tly r e construct ed using large-scale structure tracers such 
as galaxy lensing or 21cm emission ( 124 : Il25l : Il26t ). The 21cm power spectrum is relatively 
featureless, and hence contains relatively little information from a given redshift about the 
lensing potential. However the po ssibi lity of using different redshift slices in principle allows 
21cm reconstruction to be useful (Il2( 



Here we concentrate on reconstruction from the CMB temperature and polarization, and 
assume the primordial fields are Gau ssian (if there is a primordial non-Gaussian signature 
this can complicate the analysis ()ll9| )). In this section we discuss various ways of estimating 
the lensing potential. In Section |H1 we discuss the application to delensing and how accurately 
it can be done in principle and practice. Applications to cluster CMB lensing are discussed 
further in Section Hi. 31 



7.1 Temperature quadratic estimator 



For a fiat-sky statistically-isotropic ensemble, harmonics with different 1 are uncorrelated, 
(9(I)6*(I')) = 5(\ — V)Cf . However the actual sky we observe is of course not isotropic, and 
in particular for a given fixed lensing potential, the distribution of the observed temperature 
will not be isotropic. This suggests that we may be able to use the quadratic off-diagonal 
terms of th e '0- f ixed correlation (B fQQflQje to constrain the lensing potential in our sky 
realization ()l23l : 127 : llld: Il28l : Il29i ). This essentially amounts to using correlations between 



lots of temperature anisotropies around each lensing clump to constrain the potential of the 
clump. 



We use the series expansion in the deflection angle (Eq. ()4.9|) ) to lowest order in the lensing 
potential. Averaging over realizations of the unlensed temperature fleld 9 gives 
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(e(l)e*(l-L))e = 5(L)Cf - 



d^r r 



2tx 



(1-1')^(1-1')(0(1')0*(1-L)) 

+ 1' • (1 - L - - L - i')(0(i)e*(i'))l + 



met 



e 



2ti 



(L-l)-LC|tL| + l-LC'f ^(L)+0(^ 



(7.1) 



The L 7^ correlation therefore probes the lensing potential. The ip{L = 0) mode of the lensing 
potential is not observable (zero gradient), so below we implicitly consider only |L| > 0. To 
estimate the lensing potential on our sky we perform a weighted average of the off-diagonal 
terms, defining the quadratic estimator 



^(L) ^ iV(L) 



d^l;. 



2tt 



e(i)e*(i-L)(7(i,L) 



(7.2) 



where (?(!, L) is some weighting function. In order for the estimator to be unbiased at lowest 
order we want ('?/'(L))e = "^(L), so the normalization is 



i2ny 



L-i)-LqV-Li + i-LC^z® giW- 



(7.3) 



We are then free to choose g to maximize the signal to noise. To zeroth order in Cf the 
variance is given by {\ip(L) — ^^(L)^) ~ (|'?/'(L)p) and the non-Gaussian connected part of the 
four-point function (Eq. ()6.4|) ) can be neglected. We then have 



(V^*(L)^(L'))=<5(L-L')2iV(L)^ 



d^I . 



(2vr; 



(7.4) 



where C*°* = Cf + A*"; ar id N i is the noise contribution. Minimizing the leading-order variance 
gives a weight function (|l23l ) 



^7(1,L) 



[L-l)-LqtL| + l-LQ® 



o/^tot/^tot 



(7.5) 



Here we have chosen the arbitrary normalization of g so that the lowest order 'noise' on the 
reconstructed potential is determined by 



<5(0)(|^(L)|^)-^ = iV(L 



d^I 



i2n) 



(7.6) 



The delta function here should be interpreted in terms of the sky area in the usual way: 
m) = /sky/vr. 

These results allow us to estimate the lensing potential to within some 'noise' determined by a 
combination of cosmic variance from the finite number of temperature modes and observational 
noise. Note that the estimator is only valid to lowest order in the lensing potential; this 
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means that in reality the estimator is biased due to the significance of higher-order terms. In 
particular the 0{Cf) terms we have neglected will be significant when attempting to estimate 
the lensing potential power spectrum from the variance. These terms need to be ac coun ted 
for via a correction to get unbiased results ()l29[ l. and a full analysis is given in Ref. ()l3Cll l. 



The estimator can be generalized to the curved sky ()128[ ). but for small sky areas curved-sky 
corrections are likely to be small compared to the neglect of higher order terms. Note also 
that this is just a quadratic estimator, it is not the optimal estimator — see the discussion of 
Bayesian methods below. 

Combining the above results, the harmonic-space result for the estimator becomes 

The integral is a harmonic-space convolution, which may therefore be written alternatively as 
a real-space product, giving ()l23l ) 



^(L) = -^iV(L)L- /^e-^^''Fi(x)VF2(x) 
J Zrr 

= -iV(L) J ^ e-^^-^V ■ [Fi(x)VF2(x)] (7.8) 
where we defined two filtered fields given in harmonic space by 

This tells us that the estimator of the potential is related to correlations between the gradient 
of the temperature (encoded in VF2(x)) and the small-scale filtered anisotropy Fi(x). These 
should be correlated because on small scales the unlensed CMB is approximately a temperature 
gradient, and the small-scale lensed anisotropics come from disturbing this gradient with the 
deflection angle from the lensing potential. The partic ular choice of filters improves the signal 
to noise over similar estim ators used in earlier work (Il22| ). This result can be generalized to 
the curved sky case (|l28[ ) for application to large-area observations. The representation of 
the estimator in terms of real-space products of filtered fields is numerically efficient because 
the integrals are now in the form of Fourier transforms which can be performed quickly using 
FFTs. On the curved sky they can be performed in terms of spherical harmonic transforms, 
which are still quite efficient because the azimuthal integrals can be performed using FFTs. 

In practice the reconstruction may not be so straightforward. In addition to the error from 
the use of the series expansion, the lensing signal can also be complicated by contributions 
from the kinetic-SZ effect, and non-linear evolution may be important on small scales. If not 
taken into account these can s ignificantly bias the result obtained from the simple quadratic 



estimator derived above (jlSOt 1771 ). The kinetic SZ signal comes predominantly from large 
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:1I 



Fig. 15. A simulated reconstruction of the magnitude of the deflection angle from Ref. (1123). The left 
panel shows the deflection from the realization used in the simulation. The centre panel shows the 
field reconstructed using the temperature quadratic estimator. The right hand panel shows the better 
result from using the EB quadratic estimator. The simulation is for an idealized CMB observation 
with isotropic white-noise variance = Nj^ /2 = Nj^ /2 = 8.5 x lO^^/^K^ iV^F = 1/uK-arcmin) 
and beam full-width half-maximum of 4arcmin. The frames are 10° x 10°. 



clusters, so masking out these sources using thermal SZ measurements can help to reduce this 
source of reconstruction error (at the expense of slightly reducing the lensing signal) ([ttI ). 



7.2 Polarization quadratic estimators 



Precision observations of the lensed CMB polarization field h ave the potent i al to improve 



greatly the lensing reconstruction from the temperature alone (|131 



1271; [1281; |27D. The 



reason why the polarization is potentially much better is that the sky should have zero TB 
and EB correlations without lensing: they have no cosmic variance from the unlensed fields on 
small scales. These terms can therefore be used to construct good quadratic estimators for the 
lensing potential along the lines of the temperature estimator. In addition, lens-induced B- 
modes typically arise from smaller-scale lenses than for temperature, allowing reconstruction 
of the lensing potential out to smaller scales. 



For two CMB fields X and Y, the estimators are of the form ()l27l ) 

d^l 



XY\ 



N 



XY\ 



27T 



X(l)r*(l-L)(?xy(l,L) 



(7.10) 



The BB estimator gives very little information when there are no unlensed i?-modes: to get 
a good estimator we need to correlate the lens-induced B modes with the E modes. For the 
case X = E and Y = B the minimum-variance estimator has 



gEB(},'L) = sin(20) 



+ (L-1)-Lqf_i^l 

r^tot-B/^tot-B 
"-^ I ll-LI 



(7.11) 



where (j) is the angle between 1 and L — 1 and on small scales we expect the unlensed CP = 0. 
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Fig. 16. A simulated reconstruction of the lensing convergence k from Ref. (|23) using CMB polariza- 
tion. The left panel shows a realization of the convergence used in the simulation. The centre panel 
shows the Wiener filtered field reconstructed using the quadratic estimator. The right hand panel 
shows the improved iterative Bayesian estimator of Ref. ((23) • The simulation is for an idealized CMB 
observation with isotropic white-noise variance Nf = Nj^ = 1.7 X 10" WNf = 1.41^K arcmm, 
about 50 times lower than Planck) and beam full-width half-maximum of 4 //K-arcmin. The frames 
are 8°32' in width, and show only the I < 1600 modes for clarity. The Wiener filtered field appears 
smoother because the filtering suppresses the poorly recovered small-scale modes. 



The corresponding lowest-order noise is given by 



5m\^EBm')-' = NEB{'L) 



1 ■ L + (L - 1) ■ L qf_L|l sin2(20) 



(2vr) 



r^totEntotB 
'-'I |l-L| 



(7.12) 



As for temperature, this weighting is optimal only for the leading-order part of the variance 
(which is what is given in the last equatio n). R esults for the other polarization estima tors take 
a similar form and can be found in Ref. (127), and efficient forms are given in Ref. ()l28l ). In 
the general case all the possible estimators may be combined to give a total minimum-variance 
estimator for the lensing potential. 



For low-noise obser vations the EB estimator turns out to give the highest signal-to-noise 
reconstruction (|l27|). We can see the reason for this in Eq. ()7.12j) — for low noise levels C^°^f 
is small, and given entirely by the lensing contribution to the 5-modes, so the noise is small. A 
simulated reconstruction comparing the temperature and EB results for a simulated low-noise 
observation is shown in Fig. ^1 



One might wonder why it is the lensed fields that are determining the noise level. This is in 
fact only a property of the quadratic estimator: more general methods have cosmic variance 
determined essentially by the unlensed fields. The unlensed B modes are expected to vanish 
on small scales, so in this case the noise on the reconstruction should be tiny. We therefore 
now consider a better, more Bayesian, analysis of the reconstruction problem. 
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1.3 Bayesian methods 



Assuming the unlensed fields, lensing potential and noise are all Gaussian, we can write down 
exactly what the probability of a given observation should be. From this we can use Bayes' 
theorem to solve for the posterior distribution of the fields given the observed data. This 
probability function is all we can ever know about the fields, and therefore contains the most 
information we can hope to extract. Useful estimators such as the maximum-likelihood can in 
principle be found once the full probability function is known. 

We write the unlensed CMB field as a vector in pixel or harmonic space, X. Here X is in general 
taken to have components for the temperature, E and B polarization fields. The observation 
consists of the lensed fields plus noise, X*°* = X + n. Given a theoretical covariance of the 
unlensed fields S = (XX'I") determined by the power spectra, the posterior distribution of the 
lensing potential and unlensed fields is given by 

-21ogP(^,X|X*°\ S,N) = [X*°* - A(V')X]fAr-i[X*°* - A(V^)X] - 2 logP(V^, X|5) + const. 

(7.13) 

Here N is the (assumed Gaussian) noise covariance and we have defined the lensing operator 
A{iIj) so that X = A('?/')X. In real space A[ijj) just represents a point re-mapping matrix 
determining how points move around under the lensing deflection angle. Note that A{tp) is 
a non-linear function of ip unless we make the (in general poor) first-order series-expansion 
approximation. In the absence of polarization observations the prior is given by 



-2iogP(v^,e|5) = et 




(7.14) 



assuming isotropic Gaussian unlensed fields. The Q-Tp correlation is about 50% for the quadrupole, 
but falls to less than 1% by I ~ 150 (see Fig. E}. The prior therefore contains significant infor- 
mation about the potential given the large-scale unlensed temperature (which is very close 
to the lensed large-scale 0). From now on we neglect the information about the potential 
that can be learnt directly from the correlation with the large scale temperature. We shall 
concentrate instead on what can be learnt from the lensing signal, taking the temperature and 
lensing fields to be uncorrelated. 

For a fixed potential, the posterior distribution of the unlensed field X is Gaussian (the log 
probability is quadratic in X as long as the prior is Gaussian). If we are only interested in 
reconstructing the lensing potential, we can integrate out the unlensed field X. A simple way 
to get at the answer is to consider averaging over X and the noise n, but keeping the potential 
fixed, to give the variance 

Cg = (X*°*X'°*t)x,n = A(^) (XX^) A(^)t + N = A(^)5A(^)t + N. (7.15) 

Marginalized over noise and unlensed X realizations we therefore have 

-21ogP(^|X*°*,5, AT) =X*°*1'C3'^X*°' + lndetCg-2logP(^|5). (7.16) 
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This can also be obtained by a rather longer analytical integration of Eq. ()7.13|) . In principle 
this equation tells us all we need to know about the posterior distribution of the lensing 
potential. However the marginalized posterior distribution is now a complicated non- Gaus sian 



function of ip, so even finding the maximum-likelihood point is non-trivial. Refs. (|l33l : |27|) 
develop an approximate iterative scheme for solving for the maximum-likelihood point and an 
estimate of the local error. A more fully Bayesian approach might be to try to use sampling 
techniques to probe the full posterior distribution. However sampling from high- dimensional 
non-Gaussian distributions is in general difficult, and currently no workable methods have 
been developed. 

Since the Bayesian approach is optimal (uses all the information) we could expect it to be 
better than the quadratic estimator. Indeed this is the case, with the maximum-likelihood 



Bayesian estimator being superior in all cases fjl33tl27l ). For the temperature the improvement 
is actually quite small. However for the polarization it turns out that a Bayesian estimator can 
do very much better. This is essentially because the quadratic estimator has cosmic variance 
from the lensed fields, which are non-zero because lensing induces B-mode polarization by 
lensing E. The maximum-likelihood estimator knows there should be zero S-modes on small 
scales in the unlensed fields, and the cosmic variance error is then essentially zero, significantly 
reducing the noise on the estimator. Fi gure ITBl shows a low-noise simulated reconstruction using 
the iterative method compared to the quadratic estimator. We discuss this in more detail in 
the next section where we consider how well the lensing S-modes can be removed by delensing 
using estimators of the lensing potential. 



8 Delensing the lensed sky 



Analysing the full statistics of the lensed sky is complicated. For the near future, para mete r 



estimation using the lensed CMB power spectrum is likely to be accurate enough (jl3J). 
However a more optimal analysis should account for the full non-Gaussian distribution of the 
lensed sky, or, equivalently, the full posterior distribution of the unlensed sky and lensing 
potential. If the lensing potential could be reconstructed accurately, the sky could be delensed 
simply by mapping points back according to the deflection angle. This would be useful as 
the unlensed sky has much simpler statistics than the lensed sky. The smoothing effec t of 
lensing also potentially hides any small scale structure in the CMB power spectrum f)l35l ). so 
delensing would be useful to uncover this information. 

With polarization observations the lensing effect is especially important: as discussed in Sec- 
tion l5.4l the i?-mode polarization generated by lensing may dominate the signal expected from 
primordial gravitational waves. For full-sky observations, the lensing 5-modes act as a source 
of noise and limit primordial tensor-mode detections to ratios r > O{10~^) ([oTt Issh . If only 
part of the sky is observed the lensing variance is larger and the limiting ratio is higher. It is 
possible that S-modes will be detected at r > 10~^, however they could easily be smaller. For 
future 5-mode polarization observations delensing will be critical to do better than this; the 
ultimate observable limit on tensor modes may well depend on how well the lensed S-mode 
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signal can be subtracted, assuming that instrument and astrophysical foregrounds can be ad- 
equately cotrolled. Even if a larger B-raode signal is detected, we will probably still want to 
remove the lensing confusion, for example to measure the spectral index and amplitude to 
better precision and test for Gaussianity. Since B-mode subtraction relies on using small-scale 
anisotropies to reconstruct the lensing potential, future observations aiming at low tensor 
amplitude will need to have high resolution as well as high sensitivity. 

Although delensing (if doable) would be useful, if we can model the full statistical distribution 
of the lensed sky accurately enough it is not actually needed: an optimal analysis could be 
done by using the full true likelihood function. However in practice this is likely to be very 
difficult, and delensing methods would in any case be very useful as a way of identifying 
deviations from the expected model. 

With observations of just the one lensed sky is it possible to learn about both the unlensed 
sky and the lensing potential? An observation of the CMB temperature in N pixels provides 

just N numbers, so clearly in general it is not possible to solve exactly for the 2N numbers 
needed to describe both the unlensed temperature and the lensing potential fields. However 
since we know a lot about the statistical properties of both fields, it should be possible to 
constrain both fields statistically. In particular, for the temperature we could ideally assume 
that there is very little unlensed power on small scales, so the unlensed CMB is close to a 
pure temperature gradient. Assuming we can measure the background gradient at some point, 
the lensing defiection angles aligned with the gradient can then be reconstructed exactly by 
noting that the change in the observed temperature due to lensing is AO(x) = V© • a(x). 
Orthogonal defiection angles cannot be reconstructed, so we can only measure half of the 
possible information in the deflection fleld. However the deflection angle fleld is expected to be 
the gradient of the lensing potential, a = V'?/^, so its curl is zero: in harmonic space the lensing 
signal is A0(I) = iV0 ■ Hence all modes of the lensing potential can me measured 

except for the zero- measure modes with orthogonal 1 so that V0 - I = 0. In the presence 
of noise modes with 1 nearly parallel to the direction of the gradient will be constrained 
much better those nearly orthogonal. On larger scales our prior is that the unlensed CMB 
is a Gaussian field with an (approximately) known power spectrum. Any deviations from 
consistency with this prior gives extra information that can be used to constrain at least some 
modes of the lensing potential. For example a large-scale lensing mode would correlate the 
shapes of background anisotropy blobs over the scale of the mode. 

When polarization information is available the situation is much improved. To see this on small 
scales, consider the damping tail where both the primordial E and 5-modes are expected to be 
very small. There are now two unlensed gradient fields, the gradients of the Q and U Stokes' 
parameters. In some fixed basis the excess in the lensed fields are AQ{^x) = 'VQ ■ Q:(x) and 
A[/(x) = VC/ • Q:(x) in the gradient approximation. Since the gradients are random they are 
generally not aligned, and hence both components of the deflection angle can be measured 
without assuming that the defiection field is a gradient. In this case, the curl-free property of 
the defiection angle field is then a consistency check. The exceptions are the measure-zero cases 
when the two gradients are aligned, in which case the analysis is equivalent to the lensing of 
the temperature anisotropies described above. On larger scales the story is more complicated. 
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However, since we expect there to be no unlensed _B-modes (except possibly on scales above a 
degree or so) the situation is still much better than with the temperature: if we assume there 
are no unlensed -B-modes, we only have to solve for the unlensed Gaussian E field and the 
lensing potential. Since we observe both the lensed E and B fields there should be enough 
information to solve for both if we assume a curl-free deflection angle. The number of modes 



that cannot be recovered have measure zero (j27|), for example when the unlensed polarization 



happens to be zero over a finite area. Furthermore the polarization ii^-mode power spectrum 
is skewed to smaller scales than the temperature, so there are effectively more background 
shape-correlated CMB blobs available to constrain the large-scale lensing modes. This reduces 
the noise on large-scale mode reconstruction from the lensed non-Gaussianity, though on small 
scales this effect is compensated by the fact that small-scale power acts as a source of correlated 
noise on top of the otherwise clean unlensed background gradients. 

How well can the lensing i?-mode signal actually be subtracted? Early work on subtraction 
methods applied quadratic estimators to reconstruct the lensing potential, and then used the 
estimated lensing potential to delens the sky. These method s ind i cate d tha t the lensing power 



can be reduced by a factor ~ 10 in the zero-noise limit ()136l: Il37t 11381 ). Although nearly 
optimal for large noise levels, as discussed in the previous section the quadratic estimator is 
not optimal due to the unnecessarily large cosmic-variance error. 



It was pointed out by Refs. ()27t 11381 ) that in fact one should be able to do much better than 
the quadratic estimator. The reasoning is as follows. The primordial 5-mode signal peaks at 
/ ~ 100, and falls off on smaller scales (see Fig. IT^. So we are really interested in subtracting 
the lensing signal only on large scales, / < 200 say. The i?-modes on smaller scales, peaking 
at / ~ 1000, will be dominated by the lensing signal. There is therefore a very large amount 
of unambiguous information available in the 5-modes at / > 200, which can be used to 
reconstruct the lensing potential. Since the lensing potential peaks at / ~ 60, fairly large-scale 
lensing modes will be responsible for most of lensing signal. The large-scale modes in the 
lensing potential can therefore be constructed extremely accurately using the large number of 
small-scale lensed B-mode observations. 

If the low-/ 5-mode lensing signal had no power from small-scale lensing modes, the recon- 
struction from lowest-order lensing might be exact in principle: there would be an essentially 
infinite number of small-scale 5-modes available to reconstruct a small finite number of large 
scale lensing potential modes. In fact the low-/ i?-modes are sensitive to small scale power 
in the lensing potential (~ 10% of comes from Cf at / > 1000 — see Fig. Ill|) . so the 
actual limit in practice is not obvious. In reality the improvement over th e qu adratic esti- 



mator methods depends on the noise level. For low noise levels, Refs fl27l : 11381 1 claim that 
at least a factor 40 delensing reduction in the lensing power using their approximate iter- 
ative maximum-likelihood method. The ultimate limit for ideal low noise experiments may 
therefore approach r ~O(10~^), depending on the optical depth, corresponding to a poten- 
tial V*^"^ ~ 10^^ Ge V fll38[). In rea lity foreground and experimental issues are likely to be a 
si 



ignificant obstacle ( 139[ 



1401: 11411 



Technically the posterior distribution of the unlensed sky is given by Eq. (|7.13p . integrated over 
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all possible lensing potentials. However the lensing matrix A(?/') is a non-linear function of if), 
and so cannot easily be integrated analytically. We must either resort to approximations like 
those discussed above, or use sampling methods. One potentially pr omis ing sampling method 
is Gibbs sampling, applied to the simpler unlensed case in Ref. ()l42| ). The method works 
by iteratively sampling from the marginal distributions, here P(X|'0,X*°*) and P('?/'|X, X*°*). 
Asymptotically it can be shown that a random sample from this iteration corresponds to a sam- 
ple from the joint distribution P(X, ^/'IX*"*) (except in pathological cases), though convergence 
can be arbitrarily slow. Gibbs sampling requires being able to sample from the marginal distri- 
butions; this is fine for P(X|?/;, X*°*) since this distribution is Gaussian, however P(?/'|X, X*°*) 
is non-Gaussian. One possible approximation is to use a series expansion in the deflection 
angle. However unfortunately the two marginal distributions P(X|'?/', X*°*) and P('?/'|X, X*°*) 
are in general both much narrower than the marginalized distribution P(X|X*°*): given a par- 
ticular lensing potential the delensed sky is given essentially by a delta function. This means 
that naive Gibbs iterations will not converge within a reasonable time. At the time of writing 
there are no known practical methods for sampling from the full posterior distribution. 

Other tracers may also be used to learn about the lensing potential and hence delens the CMB 
signal. In particular high-redsh ift 2 1-cm observations could be a powerful way to improve 
tensor-mode limits in principle (jl25[ l. 



9 Sky curvature 



The fiat-sky approximation is very useful in that it makes calculations simpler and is likely to 
be quite a good approximation for observations covering only a small fraction of the sky. How- 
ever it may be inadequate for observations covering a significant fraction of the sky. Further- 
more, even though the lensing deflections are tiny ((9(10~^)), they are correlated over degree 
scales. This relatively large-scale correlation suggests that corrections due to sky-curvature 
may be non- negligible even on quite small scales (jc^). The P-modes induced by lensing of 
P-modes are of most interest on large scales where they confuse the tensor-mode signal, so it 
may be important to include the effects of sky curvature. 

The temperature field on the sphere can be expanded in spherical harmonics Yi^. The deflec- 
tion angle and polarization fields can be expand ed in terms of spin ±1 and spin ±2 spherical 
harmonics ±iYim and ±2^«m respectively (8^ 134), as defined in Section lB.2.21 Equivalentlv the 



fields could be expanded in terms of vector and tensor spherical harmonics (j88f l . The variance 
of the harmonic coefficients then defines power spectra for the fields in the usual way, with 
the Ci being independent of m by the assumption of statistical isotropy, e.g. 

(0Zm0/*m') = ^W^mm'Cf' . (9.1) 

The temperature correlation function is defined by 

m ^ (e(n0e(n2)), (9.2) 
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where (3 is the angle between the two directions (fii ■ n2 = cos/5). 

The effect of sky curvature on the 0{Cf) s eries- expansion calculation of the lensed CMB power 
spectra was given in Ref. (E^j ) (see also (|l43[ )). and shown to be above cosmic variance 
However as discussed in Section HI the perturbative calculation is not actually accurate, and 
the series-expansion approximation is in fact much worse than the neglect of curvature. An 
accurate result can however be calculated using the full-sky correlation functions which 
is the approach we follow here. 

Since deflection angles are small we can treat these accurately with flat-sky approximations, 
but we must use curved-sky results for terms involving larger scale correlations. On the sphere 
it is simplest to describe the correlation of the deflection angles in terms of the spin-1 deflection 
field la = OL ■ {bq + «e^), where and are the unit basis vectors of a spherical-polar 
coordinate system. Rotating to the basis defined by the geodesic connecting fii and £12, the 
spin-1 deflection (denoted with an overbar in the geodesic basis) has real and imaginary 
components 

«! cos'i/'i = 3?ia(ni), ai sin?/^! = $5ia(ni), (9.3) 

and similarly at n2. Here, ai = |Q;(ni)| is the length of the lensing displacement at fii and 
ijj is the angle it makes with the geodesic from fii to n2. In terms of these angles the lensed 
correlation function is 



Im 

= E C^dl^,{f3){YUc^u^i)YUa2,^2)). (9.4) 



Imm' 



Here dl^^,{(3) = D^^/(0, /5, 0) are the reduced Wigner functions (see e.g. Refs. ( 1451 : 146t )). The 



easiest way to see the last step is to put fii along the z-axis, and fi2 in the x-z plane so that fi'^^ 
has polar coordinates {ai,ipi). The harmonic at the deflected position fi2 can be evaluated by 
rotation: Yim{'n.2) = [D^^ {0, P , 0)Yim]{(y2, ^'2) , where [DYimHii) is a spherical harmonic rotated 
by the indicated Euler angles. As usual we have neglected the small correlation between the 
deflection angle and the temperature so that they may be treated as independent fields. The 
remaining average is over possible realizations of the lensing field. 

The last line of Eq. ()9.4p gives the correlation function in terms of a long-range curved-sky 
term {dl^^,{P)) and a covariance that only depends on deflection-angle scales. This suggests 



It is necessary to reduce theoretical errors that are systematic (i.e. of constant sign) over a range 
of scales to well below cosmic variance on an individual Q. This is to avoid bias in parameters that 
are sensitive to a broad range of scales. By 'cosmic variance' here we mean the error in a bin of 
some width in / comparable to the range in which the effect is important. Generally AC//C/ <^ l/l 
is suf ficien t for bias on an amplitude parameter to be small compared to its posterior width (c.f. e.g. 
Ref. (jlil ). 
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that we can use a small-angle approximation to evaluate the latter t erm. For small a the 
spherical harmonics can be approximated to within (9(1//^) factors as ()l46[ l 



47r 



6l e^^'^e*'-™^'*-'''^^ , 



V 47r 27r jo 

where L = (/ + 1/2)(cos0l, sIu^l) and a. = a(cos'?/', sin?/'). Hence 



(9.5) 
(9.6) 



4^ -mm. .',,^.2 

I mm' V / 

The expectation can easily be calculated as in the flat calculation using Eqs. ()4.2nj) and ()4.26|1 : 



,j(L-ai-L'-a2)\ ^ g-((L-ai-L'-a2)2>/2 ^ g-L2[Cgi(0)-Cgi(/3) cos{</,L-0L')+CgL2(/9) cos(</.l+</'l/)]/2_ (^g^^ 



The angular integrals can now be performed to get give modified Bessel functions involving 
Cgi(O), Cgi(/3) and Cgi,2(/5), where on the full sky we now have 



2/ _|_ 1 

Cgi,2(/3) = (ia(ni)ia(n2)) = - E ^/(/ + l)Cf rfLn(/5), 

9/ -I- 1 

Cgi(/3) = (ia*(nOia(n2))=E^^(^ + l)CMi(/5)- 



(9.9) 



The fully non-perturbative result (new to this paper) is 



I mm' 



2 



(9.10) 



where m + m' is even and we defined cr^(/?) = Cgi(O) — Cgi(/3). Unlike the fiat result, this result 
depends on Cgi(/9) explicitly as well as via cr^(/?). In the large / limit it is consistent with the 
fiat result of Eq. ()4.29|) because (/?) J2u{W) and we then have 



(9.11) 



/J = — oo 



which follows from setting = vr in Eq. ()4.28|) . This means that contributions from Cgi{f3) 
can only be important at low /. At low / the term is close to 5^0 because Cgi(/3) is small, 
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Fig. 17. The fractional difference between the lensed CMB power spectra using the flat-sky approx- 
imation compared to the full-sky method (using the second-order result given in Ref. ItqI)). 

suggesting that we can neglect the Cgi(/5) dependence entirely. This is indeed a good approx- 
imation. Another way to see this is by noting that deflections angles that do not change the 
unlensed separations must have no effect: equal deflections along the direction of the connect- 
ing geodesic cannot affect the correlation function, and similarly for non-parallel deflections 
leaving the separation invariant. In the flrst-order a approximation this means that the un- 
averaged correlation function can only be a function of a.i — 0L2 (for any /?), though at higher 
order there will be non-zero terms in 0:1 + 0:2. Since our approximation for the ^^im is only 
flrst order in a, we can consistently neglect any higher-order dependence in Q:i + f5;2, as we also 
can at high / where the flat-sky approximation applies. Since explicit Cgi(/?) dependence only 
enters when there are cii + 02 terms present, it is safe to neglect it (a zeroth-order expansion 
in Cgi(/3) should be accurate). 



At our level of approximation we can take = (/ + 1/2)^ = Z(/ + 1) + 1/4 
lensed correlation function is then given to good accuracy by 



/(/ + 1), and the 



e-E^C7fe-'('+^)-V2 ^ /4/(/ + l)C7g,2/2]dU 

/ n=—l 

= E + ^^C,,A-. + •■■}• (9.12) 

Here and below we suppress the /? arguments for clarity. The flrst line gives a fully non- 
perturbative result, valid on all scales, that generalizes the flat-sky result of Eq. ()4.29|1 . The 
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last line gives the first terms in a perturbative expansion. The partial flat-sky approximation 
we have used here is very good: compared to the full second-order calculation of Ref. (0) the 
above result if correct to within 0{1/P) factors, giving lensed Ci indistinguishable from the 
full result (differences certainly well below cosmic variance on scales of interest). 

Results for the polarization correlation functions can be worked out using a similar method. As 
for the temperature, we evaluate the polarization correlation functions by taking fii along the 
z-axis and n2 in the x-z plane at angle f3 to the 2;-axis. With this geometry, the polar-coordinate 
basis is already the geodesic basis connecting fii and n2 so that the lensed correlation functions 
are ^ 



e+ = (P*(nOP(n2)) 

= E (Cf + Cf)4„.'(/3)(e-'^^^2l^,:„(«i,V^i)2r,„,(«2,^2)e^^^^), (9.13) 

Imm' 

e- = (P(ni)P(n2)) 

= E (Cf - Q^)c/L^,(/?)(e2^'^^_2y,:;(ai,^i)2y.^,(«2,^2)e^^'^^), (9.14) 

Imm' 

= E Cfdl^,iP){YUauA)2Yin.'ia2,^P2)e''^'). (9.15) 

Imm' 

The additional exponential factors here arise from rotating the spin harmonics at the two 
points from the polar basis to the appropriate geodesic bases; see Ref. (|70| ) for details. Applying 
the same approximations and methods as for the temperature gives the results 



21 + 1 



I 



L ^ E ^(C'f + Q^)e-^('-^^)^ E In m + l)C,,2/2] rfU)(-n+2) 
I n=—l 

= E ^(^f + Q^)e-'('+^)-^/^{42 + + l)Qi,24i + • • • } (9.16) 
I - E ^(^f - Cne~''^^'>'/' E W + l)C,,2/2] 

I n=~l 

= E ^(^f - Cne-^^''-'^-'^'[4-2 + \KI + l)C,i,2 (4-1 + 4-3) + • • • } (9.17) 



21+ 1 



I 



^x^T.^Cfe-'^'-''^'' E 4[^(^+l)Cgi,2/2]4„+i)(_„+,) 



, 4tt , 

I n=—l 



= E ^c^.-"e-'('-^^)-^/^{42 + + l)C^.i,2 (4i + 4-0 + • • • } (9.18) 

where I^^ = 21" — In and dashes denote derivatives with respect to the argument. Note that 
the lowest-order curved-sky results can be less accurate than a higher-order fiat-sky result; for 
numerical work, higher-order terms in Cgi,2 must be included (17(1 ). 
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The lensed correlation functions can be used to calculate the lensed full-sky Ci using the 
relations 
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(9.19) 
(9.20) 
(9.21) 
(9.22) 



The full curvature corrections to the fiat correlation function result are shown in Fig. El The 
corrections are at the small fraction of a percent level for all but the B spectrum, though above 
cosmic variance on small scales. Due to the mode mixing induced by the lensing the corrections 
appear at all scales in the lensed CMB power spectra, and in particular there is a percent 
level effect on the CP power spectrum at all /. Calculation of CP to this precision is difficult 
as it depends on the small-scale non-linear shape of the lensing potential power spectrum, 
so other errors and uncertainties may dominate; however the full-slcy result is numerically 
not significantly harder to calculate than the fiat-sky approximation and hence should be 
used. It is included in the numerical CMB codes camb and cmbeasy . Note however 
that theoretical uncertainties in the recombination ionization history and various late-time 
secondaries are likely to be much more important than the sky-curvature correction. 

Useful relations between fu ll-sky and fiat-sky harmonics and correlation functions are given 
in the appendices of Refs. ()llOl: Il28[ ). Most of the fiat-sky results we have given in previous 



sections can be generalized to the full sky, e.g. the quadratic estimators for the potential (112 
and results for the trispectrum structure (|llCll ). 



10 Simulations 



Simulating lensed CMB maps is straightforward in the lowest weak lensing approximation if 
both the lensing potential and unlensed CMB fields are Gaussian and statistically isotropic: 
make two maps, one of the gradient of the lensing potential and one of the CMB, and a lensed 
map can then be made by remapping points on the CMB map acco rdi ng to the deflection 
vectors o: = Wtjj from the lensing map using X(n) = X(n + a) ()l22l : lisi : Il23l ). The gradient 
of the deflection angle can be taken easily in harmonic space, then transformed to get maps 
of the X and y components. In practice the CMB map needs to be interpolated or generated 
at high resolution in order not to suffer from pixelization issues when remapping the points. 

^^Ihttp : //camb . info" 
http : //cmbeasy . org^ 
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On the full sky a pixel remapping scheme can also be used (|134^ . The deflection vector 
fleld can be constructed from vector spherical harmonics (or equivalently spin-1 harmonics), 
and spherical trigonometry used to calculate the pixel remapping from the geo desic in the 
direction of the deflection vector. A parallelized code based on healpix (|l47l ) is publicly 
available . Simulating lensed skies is signiflcantly slower than unlensed skies because of the 
higher resolution required for the unlensed map, however it is efficiently paralellizable. Note 
that convolution with the observational beam takes place on the lensed sky, so for an accurate 
simulation the unlensed sky must be simulated at high resolution independent of the beam. 

On smaller scales we may need to model the effects of non-linear evolution, perform ray-tracing 
to test the weak lensing approximations, and include other effects (such as SZ) in order to 
test their correlation and confusion with the lensing sign al. I n general this requires doing 
detailed numerical simulations for the density field (|43l : ll48 : 77 : 1491 : llSol ) (for a recent review 



on A^-body codes and references see e.g. Ref. (|l5ll )). The photon path can be computed by 
ray tracing, though in many cases the Born approximation can be used along an undefiected 
path to good accuracy to obtain results which include just the non-linear physics. 



11 Status and applications of CMB lensing 



11.1 Observational status 



At the time of writing there is no detectio n of lensing of the CMB or of lensing-induced 
cross-correlation with large scale structure (|l52| ). This is expected to change very shortly, 
either by direct detection of the non-Gaussian signature in the small-scale CMB, or by the 
improvement to model fitting by using the lensed as opposed to the unlensed CMB power 
spectra. Current data restrict the amplitude of the lensing effect to be within a factor of three 
of the expected result, but consistent with zero. However lensing is a robust prediction, so in 
the next subsections we discuss what new cosmological and/or astrophysical information we 
might be able to extract from future observations of CMB lensing. 



11.2 Cosmological model constraints 



Cosmological parameter analyses using the lensed CMB power spectra (and ignoring the non- 
Gaussian structure) is straightforward using standard Markov Chain Monte Carlo methods 
(e.g. using CoSMoMC^^ (,153,) ). and will probably be adequate for the immediate future. 
Although the lensing effect on the power spectrum at / < 2000 is significant , the effe ct on 
recovered constraints of standard flat-universe cosmological parameters is mild (|l5J;|l3J). For 



^^"http: //cosmologist . inf o/lenspix/ 
http : //cosmologist . inf o/cosmomc/ 
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non-flat models the lensing signal can however be used to break the geom etrical degeneracy, 
allowing a better CMB constraint on the curvature and dark energy (|l55r i. 

With sensitive observations, information in the lensing potential can in principle be used to 
extract extra information about cosmological parameters. For example the unlensed CMB 
anisotropy is only weakly sensitive to neutrino masses sufficiently light that they are relativis- 
tic at recombination. However the lensing potential probes later times, and the small scale 
damping effect on the lensing potential power spectrum can be significant. Ref. (^156.) finds 
that a neutrino with mass 0.1 eV only changes the unlensed spectra at the sub-percent level, 
but the effect on the potential power spectrum is ~ 5%. If the lensing potential can be re- 
constructed to within this accuracy, this extra information can be used to improve the limits 
on the neutrino mass from CMB data alone. Futu re experiments may ultimately be able to 



probe neutrino masses with error of ±0.035 eV (|l57|). Although probably not competitive with 



future Lyman-alpha and galaxy survey constraints, it is an independent check that is much 
less sensitive to non-linear evolution and biasing issues. 

The lensing potential is also a probe of the late-time evolution, and can be used to help con- 
strain the dark energy model. Cross-correlation of the lensing potential with the temperature 
can probe cur vature an d d ark energy via the effect on the large-scale integrated Sachs- Wolfe 



(ISW) effect ()lOOl : llOlt 1971 : Il02l : Il04 ). Further information can be extracted by using probes 



that are s ensit ive to somewhat different redshift ranges, for example galaxy weak lensing sur- 



veys (|26l : Il3lh . Since the growth of large-scale perturbations is sensitive to the dark energy 
sound speed, cross-correlation studies may also provide constraints on the dark energy cluster- 
ing. Unfortunately cosmic variance dominates on the largest scales where the cross-correlations 
are significant, so the precision of the constraints is limited. 

More direct constraints on the dark energy come from the angular and growth function effects 
on the lensing potential power spectrum (jlSSi ). The effect of massive neutrinos and details 
of the dark energy model (such as the equation of state) can be separated by diffe rences in 
the scale-dependence of their effect on the lensing potential power spectrum ()l56t ). This is 
illustrated for a flat universe in Fig. ^[ which shows the fractional change when the equation 
of state parameter w and neutrino mass are varied, but keeping the angular diameter distance 
to last scattering flxed. The flducial model is a cosmological constant model {w = —1) with 
massless neutrinos. These effects are, however, very nearly degenerate if only the lensed CMB 
power spectra are used: there is no angular scale for which the dominant contribution is from 
the large-angle lens es fo r which the effects of massive neutrinos and changes in the dark energy 
model are distinct ()ll4l ): see Fig. ^1 Lensing potential reconstruction (or a full non-Gaussian 
likelihood analysis) is required to provide good non-degenerate constraints. 



11.3 Cluster and galaxy mass 



In Section HI we discussed the effect of lensing on the statistics of the temperature power 
spectrum. On small scales the CMB has very little power, and what power there is comes 
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Fig. 18. Effect on lensing potential power spectrum (top) and lensed CMB spectra (bottom) of 
varying the neutrino mass (assumed the same for ah three famiUes) and equation of state of dark 
energy at fixed angular diameter distance to last scattering in a flat model. The error boxes are 
cosmic variance errors and ignore non-Gaussianity for the lensed CMB spectra. 

mainly from the lensing signal from large scale structure (and other second-order effects). 
With very dense clumps of matter it may be possible to detect them individually via their 
lensing effect, hence the idea of using CMB lensing to learn about cluster and galaxy mass. 
Since lensing is sensitive to the potential gradient, lensing constrains the projected mass 
distribution directly rather than most other observational signatures such as thermal SZ which 
are only sensitive to the baryonic component. 

The small-scale unlensed CMB is very smooth due to diffusion damping, and can be locally 
approximated by a gradient. Lensing makes photons appear to originate further from the 
centre of the cluster than they actually do, so the cold side of the gradient will look hotter after 
lensing, and the hot side will look colder. This gives a distinctive 'wiggle' in the temperature in 
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Fig. 19. Simulated effect of cluster lensing on the CMB temperature. Left: the unlensed CMB; middle: 
the lensed CMB; right: the difference due to the cluster lensing. The cluster is at redshift one, and has 
a spherically sym metric NFW profile with mass of m2oo = W^^ Mq and concentration parameter 
c = 5 (see Refs. (jl6Cll : Il62h ). Distances are in arcminutes, and can be compared to the cluster 
virial radius of 3.3arcmin. In the middle figure note the direction of the gradient inverts inside the 
~ 1 arcmin Einstein radius where the lensing deflections cross the centre of the cluster. This is a 
rather clean realization, in general the dipole pattern can be weaker and/or more complicated. 

a direction aligned with the direction of the background CMB gradient. The idea is therefore 
to model these lensing wiggles in th e o bserved temper ature (and polarization) in an otherwise 
smooth unlensed background (HH El lieO; Eel ES). For a cluster deflection field a(r), the 



wiggle due to the lensing is given simply by Q!(r) • VG in the gradient approximation. A 
particular example of the cluster lensing effect is shown in Fig. IT^ 

The rms temperature gradient is given by Eq. ()4.17|) and is ~ 14/iK/arcmin for standard 
models. Massive clusters can give deflections of the order of an arcminute, so the lensing signal 
can be at the ~ level. The scale of the dipole-like pattern induced by the cluster lensing 

is much smaller than the unlensed CMB, and so should in principle easily be observable for 
observations with high enough resolution and low enough noise. However the signal depends 
on the background gradient, so only clusters in front of a significant gradient can have their 
mass constrained this way. 

Note that in the central region of the cluster deflection angles can cross the centre, where 
the lensing is usually called strong lensing. The radius at which the deflections meet at a 
point in the centre is called the Einstein radius. However even in the strong lensing region the 
deflection angles are still small, and the Born approximation should still be valid for a single 
thin lens, so this region can be handled in just the same way as the weak lensing signal at 
larger radii. 

If the background gradient could be measured cleanly away from the cluster, the cluster 
deflection angles could be reconstructed directly from observations, and hence used to solve 
for the cluster profile and mass given certain assumptions. Unfortunately the situation is more 
complicated because the unlensed CMB isn't exactly a gradient: the background gradient 
cannot be measured reliably far from the cluster due to small scale power in the CMB (as 
well as the effect of lensing by the tails of the cluster). Fortunately the problem is statistically 
straightforward if we take the unlensed CMB field to be Gaussian: the correlation function of 
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the observed temperature should just be given by the correlation function at the undeflected 
position on the last scattering surface. We can therefore work out the likelihood of any given 
cluster deflection field a.{0) for some set of cluster parameters 6 using 

-2 log P{ct{9)\@) = e{xi)C'\xi + ai,Xj + aj)e{xj) + log\C{xi + ai,Xj + aj)\ (11.1) 



in the absence of noise ()162i ). Here C is the covariance matrix given by the correlation function 
of the unlensed field, and noise can easily be included using the noise correlation function for 
the observed positions. The effect of lensing by other perturbations along the line of sight 
can be crudely modelled by using the lensed power spectrum, though a more careful analysis 
would require a study of the non-Gaussian distribution numerically using simulations. 

This picture is complicated further by other secondary signals that cannot easily be dis- 
tinguished from the lensing signal. The effect of inhomogeneous kinetic SZ has been inves- 
tigate d from nu merical simulations, and found to be a major source of confusion in many 
cases JlelEeil)- Even in idealized cases the kinetic SZ signal can be important. For example 



if the cluster is rotating there will be a dipole-like signature th at lo oks very like that expected 
from CMB lensing, and hence acts as a source of confusion (|l62| ) (though not aligned with 
the background gradient direction). The Ostriker-Vishniac and kinetic SZ signals from inho- 
mogeneous reionization can also greatly increase the small scale power in the unlensed CMB, 
adding an effective additional correlated noise. The moving lens effect (se e Section . I11.4j) also 
introduces a dipole-like signal at the few fxK level for massive clusters ()l64l : llGSi l. However 
unlike the static lensing signal this should be uncorrelated with the direction of the back- 
ground CMB gradient, and is generally subdominant to fluctuations in the kinetic-SZ signal. 
Taken together these issues make it very challenging to obtain useful cluster constraints from 
the CMB temperature, and certainly in the near future galaxy lensing is likely to be a much 
better way to constrain cluster masses (|l62| ). 

If information about the mass of lensing clusters can be obtained by other means (e.g. using SZ 
observations with some assumptions), CMB lensing studies of many clusters can in principle 



be used to constrain cosmological parameters (|l63l ). The complicating signals like kinetic-SZ 



would have to be distinguished, for example by using their cor relation to the frequency- 



dependent thermal-SZ effect and different morphology (|l63l : Il6l| ). Residua l sig nals degrade 



the cluster lensing measurement and this is likely to be difficult in practice (jl5( 

We might also want to a ttempt a general reconstruction of the projected density field of the 
cluster profile. Ref. (Il66l ) suggest a modification to the quadratic estimator potential recon- 
struction methods of Section [7| they claim is better on cluster scales, and obtained constraints 
on cluster masses by averaging over many clusters. 

If there were sufficient small-scale structure in the CMB on cluster scales, e.g. due to secondary 
effects behind the cluster, these could also be used to measure the shear induced by the lens 
and hence used to constrain cluster mass profiles ()l22| ). However this is likely to be difficult 
in practice as it requires very high resolution observations. 



Cluster lensing techniques can in principle also be extended to galaxy CMB lensing ()l67f l. 
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though the signal is much smaller (tenths of a micro-Kelvin) and arcsecond resolution is 
required. Cross-correlation of the CMB lensing signal with other sources can help to constrain 
cosmological parameters or l earn about the source distributions, for example correlation with 
the far- infrared background ()l68| ). 



11.3.1 Cluster lensing of polarization 

Cluster lensing of the CMB polarization ()l62[ ) is conceptually very similar to the temperature 
lensing, with potentially two new gradients in Q and U (only correlated at the 10% level with 
the temperature gradient). The rms gradient of the Stokes' parameters is ~ l/xK/arcmin, so 
a signal ~ 1/iK is expected from massive clusters. Detection therefore requires noise levels 
about ten times lower than for the temperature. If the temperature signal could be used at 
this sensitivity level the temperature constraints would always be much better. However, as 
mentioned above, the temperature signal is complicated by confusion due to kineti c SZ (and 
other signals ). By contrast the polarization signal is expected to be relatively clean ( 1691 : 



8ll : llTlt Il72l ). Since there are both Q and U Stoke's parameters there are also effectively two 
random gradients to play with, so polarization cluster constraints suffer less from random 
variations in the gradient being lensed. Furthermore lensing of a temperature gradient cannot 
be used to detect deflections orthogonal to the gradient, but with the two Stokes' parameter 
gradients this degeneracy can in general be broken: polarization cluster lensing should therefore 
be useful to improve general profile reconstruction. 

In a fixed fiat sky basis, using the gradient approximation, some deflection field ol{y) gives 
the lensed polarization field 

P(r) = P(r) + a(r) ■ VP, (11.2) 

where as usual P = Q + iU . A constant gradient is neither E nor P, because making the 
E/B decomposition locally requires taking two gradients (see Section. Using the definition 
of the harmonic components in Eqs. ()5.22|) . and assuming a suitably well behaved deflection 
field, the lensed components are 



E{\) = -ail) ■ (cos(20i) VQ + sin(20i) Vt/) 

P(l) = -a(l) ■ (- sin(20i) VQ + cos(20i) Vt/) . (11.3) 

At a point the gradients of Q and U are uncorrelated, and averaging over (assumed Gaussian) 
background polarization gradients we get 



(i^(i)p)vp=^i?^i«(i)r 

(l^(l)P)vP = ^i?^|«(l)P, (11.4) 

where = (IVQP) = (|Vf/p) as in Eq. ()5.37|) . An arbitrary deflection field therefore gives 
identical E and B modes on average when lensing a pure gradient field. Taking the average 
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over a Gaussian deflection fleld gives the result we found previously in Eqs. ()5.35|) and ()5.36|) . 
If the deflection fleld has circular symmetry (as from a spherical cluster), the angular average 
of the E and B mode power are equal for any flxed polarization gradient: 



i^/d0i|^(i)r = ^(|vgr + |vt/r 

/ d0.|fi(l)p = ^ (I Vgp + I Vt/r) . (11.5) 

Clust er le nsing therefore generates equal amplitude E and B in the gradient approxima- 
tion ()l62f l. However there is very little power in the unlensed E oi B polarization on cluster 
scales, so the lensed E contains almost as much information as the lensed B: the E/ B decom- 
position is not as useful as on larger scales. If the full likelihood function is used there is in 
fact no need to use E and 5, the correlation function can be calculated directly for the Q and 
U Stoke's parameters, and the likelihood calculated using the analogue of Eq. ()11.1|) . However 
for nearby large clusters the B mode signal may allow the cluster signal to be distinguished 
from unlensed CMB 'noise', so unlike in the temperature case cluster lensing may not be CMB 
noise limited for large cluster sizes. 

Observations of CMB lensing by clusters can be used to constrain the masses of clusters at 
any redshift. For the bulk of clusters at red shift 2; ^ 1 mass measurements using lensed galaxy 
shears are however likely to be much better ()l62l ). even allowing for very optimistic future CMB 
observations. For high-redshift clusters, for which there are very few source galaxies that can 
be used for shear measurements, CMB lensing may however be a better way to constrain the 
mass if high sensitivity measurements of a clean polarization signal are available. The largest 
identifled sources of confusion are likely to be re-scattering from anis otropic thermal-SZ giving 
a frequency-dependent cluster polarization sig nal < n.7uK dlTilTTi . and cluster-sc attering of 



the CMB quadrupole giving a frequency-independent sig nal< 0.1/iK f|T73IT75[ Il76t ). However 



only dipole-like components should confuse the lensing signal of the cluster mass. 



11.4 Moving lenses and dipole lensing 



If a lensing mass is moving with respect to the CMB it corresponds to an evolving potential 
well that induces anisotropies in a manner similar to the ISW and Rees-Sciama effects. If a 
lens is moving transverse to the line of sight, a photon passing the front side of the lens (with 
respect to the direction of motion) will see a weaker potential on its way into the lens than on 
the way out, and hence receives a net redshift. Similarly a photon on the other side will receive 
a net blue shift. The moving len s therefor e induces a dipole-like temperature anisotropy in the 



photons crossing the lens' path (|16J;|177D. For small transverse 3- velocity (natural units). 



the temperature anisotropy induced by a small moving lens is at lowest order 

A0 



e 



-2 dxv±- = ■ S/3, (11.6) 
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where the integral is along the photon path, is transverse to the line of sight and 5/3 is 
the deflection angle of the photon at the lens (related to the observed deflection angle o: by 
the ratio of the relevant angular diameter distances). The signature from a cluster is therefore 
rather similar to that coming from the lensing of the background CMB temperature gradient, 
with the difference that here the dipole pattern is aligned with the transverse velocity of the 
cluster rather than the background gradient. For a IO^^H'^Mq mass cluster moving transverse 
to the line of sight at 600kms~^ the amplitude of the signal is ~ 5 x 10~^, corresponding to a 
signal of ~ l/iK. It is therefore generally subdominant to the effect of lensing of the background 
CMB gradient. 

As described above in the rest frame of the CMB the moving lens contribution is not really a 
lensing effect, but a contribution to the Rees-Sciama effect (1821) (the other contributions being 
due to the rest-frame change in the lensing mass' potential with time). However in the rest 
frame of the lens the moving lens effect can be interpreted as CMB dipole lensing, as follows. 
The velocity of the lens relative to the rest frame of the CMB induces a dipole in the lens' 
frame so that the temperature coming from direction r is B(l + v ■ r) at lowest order. In its 
rest frame, the static lens deflects the photon by an angle dp perpendicular to the line of sight 
n, so the observed temperature along this direction is 0[1 + v ■ (fi + 5/3)]. Transforming back 
to the original frame, we have a temperature anisotropy A0 = Bv^ ■ 5/3. On the back side the 
photons are deflected from the hot side of the dipole so they are hotter that they would have 
been without the deflection. On the other side they are colder. This temperature anisotropy 
is exactly the same pattern we calculated above, so dipole lensing is an equally valid way to 
interpret the signal. 

What happens if we change our velocity to be comoving with the lens? In this frame the 
lens is at rest, but our observed CMB has a dipole. We might therefore think that we could 
compute VG ■ V^/^ to get the contribution from CMB lensing by the cluster. This however 
gives a different answer in general: the lensing potential ip depends on the distance to the 
cluster and source via the angular I — x/x* factor, whereas the effect we computed above 
is independent of distance. This is essentially because the relevant dipole is the one seen by 
the lens, not the one seen by us. It is therefore not valid to consider the dipole due to our 
local motion as a dipole source at last scattering that is then deflected by the lens ()l78| ). The 
velocity dipole source must be taken to be at infinite distance to get a consistent answer. The 
primordial dipole at last scattering is however lensed, and is physically distinct from the local 
motion-induced dipole (which is also associated with other higher order signals arising from 
the transformation from the CMB frame (js^: see Section 

Typical peculiar velocities are O{10~^c), and the moving- lens signature could in principle be 
used to measure the transverse componen t. However th is may well be impossible in practice 
due to confusion with other larger sig nals (Uli Eel Il80( ) . The moving- lens signal itself is also 



a potential source of confusion when trying to measure cluster properties from CMB lensing 
as it gives a similar dipole-like pattern, albeit uncorrelated with the background gradient. 

There will also be some local effect due to our local mass distribution m oving relative to the 



CMB, giving a local contribution to the large-scale CMB anisotropies ()l8l[ l. This effect is 
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however probably very small ()178[ ). 

An interesting example of a moving-lens effect is the signal generated by a ny co smic strings 
that may be present within our horizon volume (for a review see e.g. Ref. (11821 1). A cosmic 
string with tension (energy per length) /i alters the surrounding geometry by a deficit an- 
gle SttGh- This gives an effective rest-frame deflection angle 6(3 = ArcGfi for lines of sight 
perpendicular to the string. The deflection angle is small for models consistent with current 
observations (5/5 < 10~^) and likely to be significantly smaller, so the signal from lensing 
of CMB anisotropies is small. However for strings moving close to the speed of light, this 
should induce temperature fluctuations A9/9 ~ AnGfi wi th a step- like discontinuity across 
the string. This is called the (Gott-)Kaiser-Stebbins effect ( 1831 : 184) and acts in addition to 



the signal generated from string-induced perturbations at last scattering. The Kaiser- Stebbins 
effect can also be understood in the CMB frame in terms of integrated anisotropy from the 
scalar, vector and tensor perturbations that the moving string generates. It may be possible to 
detect this cosmic-s tring imprint in future high-resolution data using its distinct non-Gaussian 
character; see Ref. (jl85|) for a targeted search and constraints with WMAP data. 

Finally, if a lens is moving along the line of sight the deflection angle calculated in the rest 
frame acquires a 0{\ ■ ii/c ) cor rection when transformed back to the CMB frame. This is 
generally totally negligible (|l86[ ). and can be consistently neglected in any lowest-order weak 
lensing calculation for O{10~^c) velocities. 



12 Summary and outlook 



It is a general prediction of metric theories of gravity that gravitational lensing must occur. It 
is therefore a robust prediction that the CMB must be lensed by potential gradients along the 
line of sight. As we have seen, this weak lensing of the CMB has several important observable 
effects, most of which are now well understood theoretically in standard models. In summary 
the lensing: 

• Perturbs the observed position of the points on the last scattering surface without changing 
the frequency dependence so the spectrum remains blackbody. 

• Can be modelled accurately using an integrated lensing potential that is only weakly sen- 
sitive to non-linear structures on scales above tens of arcminutes. 

• Smooths out the acoustic oscillations in the temperature, ii^-polarization and cross-power 
spectra by several percent. 

• Generates B-mode polarization at the fiK level by lensing of ii^-polarization. This is a 
potential source of confusion for detecting primordial gravitational waves. A significant 
fraction of the lens-induced i?-modes come from small-scale lenses and the non-linear density 
field introduces ~ 10% corrections to the B-mode power on all scales. 

• Produces a distinctive small-scale non-Gaussian trispectrum (four-point function), with 
smaller signals in the higher even n-point functions. 

• Introduces no non-Gaussian signals in the small-scale odd n-point functions if small-scale 
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correlations between the lensing potential and e.g. kinetic-SZ are neglected. 

• Introduces a large-scale non-Gaussian bispectrum (three-point function) due to correlation 
between the lensing potential and large-scale temperature. 

• Can in principle be delensed by using high-resolution polarization observations to recon- 
struct the lensing potential and re-mapping points to the last scattering surface. 

• Can be used to improve constraints on some cosmological parameters that are poorly con- 
strained by the CMB power spectra alone, such as the dark energy model and sub-e\^ 
neutrino masses. 

• Can be simulated accurately in the Gaussian limit, and also from iV-body simulation for 
more realistic small-scale analyses. 

• May be able to constrain the mass distribution of high-redshift clusters from their lensing 
signature observed at high-resolution and sensitivity. 



At the time of writing there is no detection of the lensing signal, though this is expected to 
change fairly soon. CMB observations are already at the level of precision where the lensing 
effect on the temperature power spectra must shortly be included for an accurate analysis. This 
will certainly be the case once data from the future Planck satellite , currently scheduled for 
launch in early 2008, becomes available (|l3J). Polarization experiments already taking data, 
most notably QUAD , have the sensitivity to detect the -B-modes due to lensing after ~ 
one year of integration. The next generation of polarization experiments targeting 5-modes 
from gravitational waves, such as Clover and QUIET , are being designed so that lens- 
induced 5-modes are the limiting source of confusion for detecting primary 5-modes from 
gravity waves (assuming foreground subtraction is efficient). For such instruments, lensing 
reconstruction and delensing will start to become beneficial. Proposed future missions such 
as the Infiation Probe (CMBpol) would be able to measure the lensed i?-mode signal at 
high precision, and delensing (or a full likelihood analysis) will become essential to place good 
constraints on primordial gravity waves. 



Although CMB lensing is generally well understood, there remain areas for further work. In 
particular the delensing methods for subtracting out the i?-mode lensing signal have so far only 
been developed in idealized cases (fiat sky without boundaries, simple noise and beam proper- 
ties, no non-linear structures, etc.). There are also no known general methods for performing a 
full accurate parameter likelihood analysis on lensed data once simple approximations are no 
longer good enough. Such methods may be required to obtain accurate parameter constraints 
from future high-resolution measurements of the E (and B) polarization. In principle a near- 
optimal likelihood analysis would additionally be able to constrain the gravity waves without 
a separate explicit delensing step. Analysis of the observed temperature field on small scales 
will require accurate modelling of kinetic-SZ and other non-linear signals, including correla- 
tions with the lensing signal. It may also be important to account for lensing in non-standard 
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models, for example when trying to find matched circles from non-trivial topologies. If the 
primordial fluctuations turn out to be significantly non-Gaussian or anisotropic this will also 
significantly complicate the full lensed analysis and require much further work. 
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